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(57) Abstract 

This invention relates to an isolated nucleic acid 
fragment encoding a cellulose synthase. The invention also 
relates to the construction of a chimeric gene encoding all 
or a portion of the cellulose synthase, in sense or an ti sense 
orientation, wherein expression of the chimeric gene results 
in production of altered levels of the cellulose synthase in a 
transformed host cell. 
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TITLE 

PLANT CELLULOSE SYNTHASES 
This application claims the benefit of U.S. Provisional Application No. 60/092,844. 
filed July 14, 1998. 
5 FffiLP OF THE JNVENTIQN 

This invention is in the field of plant molecular biology. More specifically, this 
invention pertains to nucleic acid fragments encoding cellulose biosynthetic enzymes in 
plants and seeds. 

BACKGROUN D OF THE INVENTION 
10 Cellulose is a major component of plant fiber, e.g. cotton fiber. Cellulose is 

composed of crystalline beta-1.4-glucan microfibrils (see World Patent Publication 
No. WO 98/00549). These microfibrils are strong and can resist enzymatic and mechanical 
degradation and are important in determining nutritional quality of animal and human 
foodstuffs. Hence, modification of the biosynthetic pathway responsible for cellulose 
15 synthesis through modification of cellulose synthase activity could potentially alter fiber 
quantity, either by producing more or less fiber in a particular plant species or in a specific 
organ or tissue of a particular plant. Modification of cellulose synthase activity could 
increase the value of the fiber to the end-user and may improve the structural integrity of the 
plant cell wall. Lastly, because cellulose is a major cell wall component, inhibition of 
20 cellulose synthesis would probably be lethal. Thus, cellulose synthase may serve as the 
target for a novel class of herbicides. Plant cellulose synthase genes, homologs of the 
bacterial celA genes encoding the catalytic subunit of cellulose synthase, have been reported 
from cotton. Arabidopsis. rice and alfala (World Patent Publication Nos. WO 98/00549 and 
WO 98/18949). 

25 There is a great deal of interest in identifying the genes that encode proteins involved 

in cellulose synthesis. These genes may be used in plant cells to control the synthesis of 
cellulose. Accordingly, the availability of nucleic acid sequences encoding all or a portion 
of a cellulose synthase would facilitate studies to better understand cellulose synthesis in 
p lants and provide genetic tools to alter cellulose production. 

30 SUMMARY OF THE INVENTION 

The instant invention relates to isolated nucleic acid fragments encoding cellulose 
biosynthesis enzymes. Specifically, this invention concerns an isolated nucleic acid 
fragment encoding a cellulose synthase and an isolated nucleic acid fragment that is 
substantially similar to an isolated nucleic acid fragment encoding a cellulose synthase. In 

35 addition, this invention relates to a nucleic acid fragment that is complementary to the 

nucleic acid fragment encoding cellulose synthase. An additional embodiment of the instant 
invention pertains to a polypeptide encoding all or a substantial portion of a cellulose 
synthase. 
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In another embodiment, the instant invention relates to a chimeric gene encoding a 
cellulose synthase, or to a chimeric gene that comprises a nucleic acid fragment that is 
complementary to a nucleic acid fragment encoding a cellulose synthase, operably linked to 
suitable regulatory sequences, wherein expression of the chimeric gene results in production 
5 of levels of the encoded protein in a transformed host cell that is altered (i.e., increased or 
decreased) from the level produced in an untransformed host cell. 

In a further embodiment, the instant invention concerns a transformed host cell 
comprising in its genome a chimeric gene encoding a cellulose synthase, operably linked to 
suitable regulatory sequences. Expression of the chimeric gene results in production of 

10 altered levels of the encoded protein in the transformed host cell. The transformed host cell 
can be of eukaryotic or prokaryotic origin, and include cells derived from higher plants and 
microorganisms. The invention also includes transformed plants that arise from transformed 
host cells of higher plants, and seeds derived from such transformed plants. 

An additional embodiment of the instant invention concerns a method of altering the 

15 level of expression of a cellulose synthase in a transformed host cell comprising: 
a) transforming a host cell with a chimeric gene comprising a nucleic acid fragment 
encoding a cellulose synthase; and b) growing the transformed host cell under conditions 
that are suitable for expression of the chimeric gene wherein expression of the chimeric gene 
results in production of altered levels of cellulose synthase in the transformed host cell. 

20 An addition embodiment of the instant invention concerns a method for obtaining a 

nucleic acid fragment encoding all or a substantial portion of an amino acid sequence 
encoding a cellulose synthase. 

A further embodiment of the instant invention is a method for evaluating at least one 
compound for its ability to inhibit the activity of a cellulose synthase, the method comprising 

25 the steps of: (a) transforming a host cell with a chimeric gene comprising a nucleic acid 
fragment encoding a cellulose synthase, operably linked to suitable regulatory sequences; 
(b) growing the transformed host cell under conditions that are suitable for expression of the 
chimeric gene wherein expression of the chimeric gene results in production of cellulose 
synthase in the transformed host cell; (c) optionally purifying the cellulose synthase 

30 expressed by the transformed host cell; (d) treating the cellulose synthase with a compound 
to be tested; and (e) comparing the activity of the cellulose synthase that has been treated 
with a test compound to the activity of an untreated cellulose synthase, thereby selecting 
compounds with potential for inhibitory activity. 

BRIEF DESCRIPTION OF THE 

35 DRAWINGS AND SEQUENCE DESCRIPTIONS 

The invention can be more fully understood from the following detailed description 
and the accompanying drawings and Sequence Listing which form a part of this application. 
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Figure 1 shows a comparison of the amino acid sequences set forth in SEQ ID NOs:2, 
4, 8, 10, 12, 14, 16, 18, 20 and 22 and the Arabidopsis thaliana sequences (SEQ ID NOs:23 
(gi 2827139), 24 (gi 2827141), 26 (gi 4467125), 27 (gi 4886756) and 29 (gi 3 13561 1)) and 
Gossypium hirsutum sequences (SEQ ID NOs:25 (gi 1706958) and 28 (gi 5081779)). 
5 Table 1 lists the polypeptides that are described herein, the designation of the cDNA 

clones that comprise the nucleic acid fragments encoding polypeptides representing all or a 
substantial portion of these polypeptides, and the corresponding identifier (SEQ ID NO:) as 
used in the attached Sequence Listing. The sequence descriptions and Sequence Listing 
attached hereto comply with the rules governing nucleotide and/or amino acid sequence 
10 disclosures in patent applications as set forth in 3 7 C.F.R. § 1 .82 1 - 1 .825. 
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Cellulose Biosynthetic Enzymes 




Protein 


Clonp Dp^icmiitirm 
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SEQ ID NO: 
(Nucleotide) (Amino Acid) 


Cellulose Synthase 


bshl.pk0002.f6 


1 


2 


v_euuiose oymnase 


Contig composed of: 
ccoln.pk0005.g3 
cdt2c.pk002.gl 
cdt2c.pk002.116 
csclc.pk002.il 
p0031.ccmar05rb 
p0110.cgsma57r 


3 


4 


Cellulose Synthase 


crln.pk0135.el0 


5 


6 


Cellulose Synthase 


p0097.cqradl7rc 


7 


8 


Cellulose Synthase 


p0122.ckamh70rc 


9 


10 


Cellulose Synthase 


rlr24.pk0073.gl 


11 


12 


Cellulose Synthase 


sdp2c.pk005.o22 


13 


14 


Cellulose Synthase 


ses8w.pk0028.f3 


15 


16 


Cellulose Synthase 


ssl.pk0036.cl0 


17 


18 


Cellulose Synthase 


Contig composed of: 
wll.pk0009.c9 
wrl.pk0160.dll 
wreln.pk0043.f9 
wreln.pk0043.h8 
wreln.pk0131.gl0 


19 


20 


Cellulose Synthase 


wlln.pk0044.bl 


21 


22 



15 The Sequence Listing contains the one letter code for nucleotide sequence characters 

and the three letter codes for amino acids as defined in conformity with the IUPAC-IUBMB 
standards described in Nucleic Acids Research 73:3021-3030 (1985) and in the Biochemical 
Journal 219 (No. 2;:345-373 (1984) which are herein incorporated by reference. The 
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symbols and format used for nucleotide and amino acid sequence data comply with the rules 
set forth in 37 CF.R. §1.822. 

DETAILED DESCRIPTION OF THE INVENTION 
In the context of this disclosure, a number of terms shall be utilized. As used herein, a 
5 "nucleic acid fragment" is a polymer of RN A or DN A that is single- or double-stranded, 
optionally containing synthetic, non-natural or altered nucleotide bases. A nucleic acid 
fragment in the form of a polymer of DNA may be comprised of one or more segments of 
cDNA, genomic DNA or synthetic DNA. 

As used herein, "contig" refers to a nucleotide sequence that is assembled from two or 

10 more constituent nucleotide sequences that share common or overlapping regions of 
sequence homology. For example, the nucleotide sequences of two or more nucleic acid 
fragments can be compared and aligned in order to identify common or overlapping 
sequences. Where common or overlapping sequences exist between two or more nucleic 
acid fragments, the sequences (and thus their corresponding nucleic acid fragments) can be 

15 assembled into a single contiguous nucleotide sequence. 

As used herein, "substantially similar" refers to nucleic acid fragments wherein 
changes in one or more nucleotide bases results in substitution of one or more amino acids, 
but do not affect the functional properties of the polypeptide encoded by the nucleotide 
sequence. "Substantially similar" also refers to nucleic acid fragments wherein changes in 

20 one or more nucleotide bases does not affect the ability of the nucleic acid fragment to 

mediate alteration of gene expression by gene silencing through for example antisense or co- 
suppression technology. "Substantially similar" also refers to modifications of the nucleic 
acid fragments of the instant invention such as deletion or insertion of one or more 
nucleotides that do not substantially affect the fimctional properties of the resulting 

25 transcript vis-a-vis the ability to mediate gene silencing or alteration of the functional 
properties of the resulting protein molecule. It is therefore understood that the invention 
encompasses more than the specific exemplary nucleotide or amino acid sequences and 
includes functional equivalents thereof. 

For example, it is well known in the art that antisense suppression and co-suppression 

30 of gene expression may be accomplished using nucleic acid fragments representing less than 
the entire coding region of a gene, and by nucleic acid fragments that do not share 100% 
sequence identity with the gene to be suppressed. Moreover, alterations in a nucleic acid 
fragment which result in the production of a chemically equivalent amino acid at a given 
site, but do not effect the functional properties of the encoded polypeptide, are well known in 

35 the art. Thus, a codon for the amino acid alanine, a hydrophobic amino acid, may be 

substituted by a codon encoding another less hydrophobic residue, such as glycine, or a more 
hydrophobic residue, such as valine, leucine, or isoleucine. Similarly, changes which result 
in substitution of one negatively charged residue for another, such as aspartic acid for 

4 
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glutamic acid, or one positively charged residue for another, such as lysine for arginine, can 
also be expected to produce a functionally equivalent product. Nucleotide changes which 
result in alteration of the N-terminal and C-terminal portions of the polypeptide molecule 
would also not be expected to alter the activity of the polypeptide. Each of the proposed 
5 modifications is well within the routine skill in the art, as is determination of retention of 
biological activity of the encoded products. 

Moreover, substantially similar nucleic acid fragments may also be characterized by 
their ability to hybridize, under stringent conditions (0.1X SSC, 0.1% SDS, 65°C), with the 
nucleic acid fragments disclosed herein. 

10 Substantially similar nucleic acid fragments of the instant invention may also be 

characterized by the percent identity of the amino acid sequences that they encode to the 
amino acid sequences disclosed herein, as determined by algorithms commonly employed by 
those skilled in this art. Preferred are those nucleic acid fragments whose nucleotide 
sequences encode amino acid sequences that are 80% identical to the amino acid sequences 

15 reported herein. More preferred nucleic acid fragments encode amino acid sequences that 
are 90% identical to the amino acid sequences reported herein. Most preferred are nucleic 
acid fragments that encode amino acid sequences that are 95% identical to the amino acid 
sequences reported herein. Sequence alignments and percent identity calculations were 
performed using the Megalign program of the LAS ARGENE bioinformatics computing suite 

20 (DNASTAR Inc., Madison, WI). Multiple alignment of the sequences was performed using 
the Clustal method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the 
default parameters (GAP PEN ALT Y= 10, GAP LENGTH PENALTY=10). Default 
parameters for pairwise alignments using the Clustal method were KTUPLE 1, GAP 
PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5. 

25 A "substantial portion" of an amino acid or nucleotide sequence comprises an amino 

acid or a nucleotide sequence that is sufficient to afford putative identification of the protein 
or gene that the amino acid or nucleotide sequence comprises. Amino acid and nucleotide 
sequences can be evaluated either manually by one skilled in the art, or by using computer- 
based sequence comparison and identification tools that employ algorithms such as BLAST 

30 (Basic Local Alignment Search Tool; Altschul et al. (1993) J. Mol Biol. 275:403-410; see 
also www.ncbi.nlm.nih.gov/BLAST/). In general, a sequence often or more contiguous 
amino acids or thirty or more contiguous nucleotides is necessary in order to putatively 
identify a polypeptide or nucleic acid sequence as homologous to a known protein or gene. 
Moreover, with respect to nucleotide sequences, gene-specific oligonucleotide probes 

35 comprising 30 or more contiguous nucleotides may be used in sequence-dependent methods 
of gene identification (e.g., Southern hybridization) and isolation (e.g., in situ hybridization 
of bacterial colonies or bacteriophage plaques). In addition, short oligonucleotides of 12 or 
more nucleotides may be used as amplification primers in PCR in order to obtain a particular 
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nucleic acid fragment comprising the primers. Accordingly, a ''substantial portion" of a 
nucleotide sequence comprises a nucleotide sequence that will afford specific identification 
and/or isolation of a nucleic acid fragment comprising the sequence. The instant 
specification teaches amino acid and nucleotide sequences encoding polypeptides that 
5 comprise one or more particular plant proteins. The skilled artisan, having the benefit of the 
sequences as reported herein, may now use all or a substantial portion of the disclosed 
sequences for purposes known to those skilled in this art. Accordingly, the instant invention 
comprises the complete sequences as reported in the accompanying Sequence Listing, as 
well as substantial portions of those sequences as defined above. 

10 u Codon degeneracy" refers to divergence in the genetic code permitting variation of 

the nucleotide sequence without effecting the amino acid sequence of an encoded 
polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment 
comprising a nucleotide sequence that encodes all or a substantial portion of the amino acid 
sequences set forth herein. The skilled artisan is well aware of the "codon-bias" exhibited 

15 by a specific host cell in usage of nucleotide codons to specify a given amino acid. 

Therefore, when synthesizing a nucleic acid fragment for improved expression in a host cell, 
it is desirable to design the nucleic acid fragment such that its frequency of codon usage 
approaches the frequency of preferred codon usage of the host cell. 

"Synthetic nucleic acid fragments" can be assembled from oligonucleotide building 

20 blocks that are chemically synthesized using procedures known to those skilled in the art. 
These building blocks are ligated and annealed to form larger nucleic acid fragments which 
may then be enzymatically assembled to construct the entire desired nucleic acid fragment. 
"Chemically synthesized", as related to nucleic acid fragment, means that the component 
nucleotides were assembled in vitro. Manual chemical synthesis of nucleic acid fragments 

25 may be accomplished using well established procedures, or automated chemical synthesis 
can be performed using one of a number of commercially available machines. Accordingly, 
the nucleic acid fragments can be tailored for optimal gene expression based on optimization 
of nucleotide sequence to reflect the codon bias of the host cell. The skilled artisan 
appreciates the likelihood of successful gene expression if codon usage is biased towards 

30 those codons favored by the host. Determination of preferred codons can be based on a 
survey of genes derived from the host cell where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, including 
regulatory sequences preceding (5' non-coding sequences) and following (3' non-coding 
sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its 

35 own regulatory sequences. "Chimeric gene" refers any gene that is not a native gene, 
comprising regulatory and coding sequences that are not found together in nature. 
Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that 
are derived from different sources, or regulatory sequences and coding sequences derived 



WO 00/04166 



PCT/US99/15871 



from the same source, but arranged in a manner different than that found in nature. 
"Endogenous gene" refers to a native gene in its natural location in the genome of an 
organism. A foreign" gene refers to a gene not normally found in the host organism, but 
that is introduced into the host organism by gene transfer. Foreign genes can comprise 
5 native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene 
that has been introduced into the genome by a transformation procedure. 

"Coding sequence" refers to a nucleotide sequence that codes for a specific amino acid 
sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5* non- 
coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, 

10 and which influence the transcription, RNA processing or stability, or translation of the 
associated coding sequence. Regulatory sequences may include promoters, translation 
leader sequences, introns, and polyadenylation recognition sequences. 

"Promoter" refers to a nucleotide sequence capable of controlling the expression of a 
coding sequence or functional RNA. In general, a coding sequence is located 3' to a 

15 promoter sequence. The promoter sequence consists of proximal and more distal upstream 
elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a 
nucleotide sequence which can stimulate promoter activity and may be an innate element of 
the promoter or a heterologous element inserted to enhance the level or tissue-specificity of 
a promoter. Promoters may be derived in their entirety from a native gene, or be composed 

20 of different elements derived from different promoters found in nature, or even comprise 
synthetic nucleotide segments. It is understood by those skilled in the art that different 
promoters may direct the expression of a gene in different tissues or cell types, or at 
different stages of development, or in response to different environmental conditions. 
Promoters which cause a nucleic acid fragment to be expressed in most cell types at most 

25 times are commonly referred to as "constitutive promoters". New promoters of various 
types useful in plant cells are constantly being discovered; numerous examples may be 
found in the compilation by Okamuro and Goldberg (1989) Biochemistry of Plants 75:1-82. 
It is further recognized that since in most cases the exact boundaries of regulatory sequences 
have not been completely defined, nucleic acid fragments of different lengths may have 

30 identical promoter activity. 

The "translation leader sequence" refers to a nucleotide sequence located between the 
promoter sequence of a gene and the coding sequence. The translation leader sequence is 
present in the fully processed mRNA upstream of the translation start sequence. The 
translation leader sequence may affect processing of the primary transcript to mRNA, 

35 mRNA stability or translation efficiency. Examples of translation leader sequences have 
been described (Turner and Foster (1995) Molecular Biotechnology J:225). 

The "3* non-coding sequences" refer to nucleotide sequences located downstream of a 
coding sequence and include polyadenylation recognition sequences and other sequences 

7 
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encoding regulatory signals capable of affecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid 
tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is 
exemplified by Ingelbrecht et al. (1989) Plant Cell 7:671-680. 
5 "RNA transcript" refers to the product resulting from RNA polymerase-catalyzed 

transcription of a DNA sequence. When the RNA transcript is a perfect complementary 
copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA 
sequence derived from posttranscriptional processing of the primary transcript and is 
referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the RNA that is 
10 without introns and that can be translated into polypeptide by the cell. "cDNA" refers to a 
double-stranded DNA that is complementary to and derived from mRNA. "Sense" RNA 
refers to an RNA transcript that includes the mRNA and so can be translated into a 
polypeptide by the cell. "Antisense RNA" refers to an RNA transcript that is 
complementary to all or part of a target primary transcript or mRNA and that blocks the 
15 expression of a target gene (see U.S. Patent No. 5, 1 07,065, incorporated herein by 

reference). The complementarity of an antisense RNA may be with any part of the specific 
nucleotide sequence, i.e., at the 5' non-coding sequence, 3' non-coding sequence, introns, or 
the coding sequence. "Functional RNA" refers to sense RNA, antisense RNA, ribozyme 
RNA, or other RNA that may not be translated but yet has an effect on cellular processes. 
20 The term "operably linked" refers to the association of two or more nucleic acid 

fragments on a single nucleic acid fragment so that the function of one is affected by the 
other. For example, a promoter is operably linked with a coding sequence when it is capable 
of affecting the expression of that coding sequence (i.e., that the coding sequence is under 
the transcriptional control of the promoter). Coding sequences can be operably linked to 
25 regulatory sequences in sense or antisense orientation. 

The term "expression", as used herein, refers to the transcription and stable 
accumulation of sense (mRNA) or antisense RNA derived from the nucleic acid fragment of 
the invention. Expression may also refer to translation of mRNA into a polypeptide. 
"Antisense inhibition" refers to the production of antisense RNA transcripts capable of 
30 suppressing the expression of the target protein. "Overexpression" refers to the production 
of a gene product in transgenic organisms that exceeds levels of production in normal or 
non-transformed organisms. "Co-suppression" refers to the production of sense RNA 
transcripts capable of suppressing the expression of identical or substantially similar foreign 
or endogenous genes (U.S. Patent No. 5,23 1 ,020, incorporated herein by reference). 
35 "Altered levels" refers to the production of gene product(s) in transgenic organisms in 

amounts or proportions that differ from that of normal or non-transformed organisms. 

"Mature" protein refers to a post-translationally processed polypeptide; i.e., one from 
which any pre- or propeptides present in the primary translation product have been removed. 

8 
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"Precursor" protein refers to the primary product of translation of mRNA; i.e.. with pre- and 
propeptides still present. Pre- and propeptides may be but are not limited to intracellular 
localization signals. 

A "chloroplast transit peptide" is an amino acid sequence which is translated in 
5 conjunction with a protein and directs the protein to the chloroplast or other plastid types 
present in the cell in which the protein is made. "Chloroplast transit sequence" refers to a 
nucleotide sequence that encodes a chloroplast transit peptide. A "signal peptide" is an 
amino acid sequence which is translated in conjunction with a protein and directs the protein 
to the secretory system (Chrispeels (1991) Ann. Rev. Plant Phys. Plant Mol. Biol. 42:21-53). 
10 If the protein is to be directed to a vacuole, a vacuolar targeting signal (supra) can further be 
added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention signal (supra) 
may be added. If the protein is to be directed to the nucleus, any signal peptide present 
should be removed and instead a nuclear localization signal included (Raikhel (1992) Plant 
Phys. 700:1627-1632). 

15 "Transformation" refers to the transfer of a nucleic acid fragment into the genome of a 

host organism, resulting in genetically stable inheritance. Host organisms containing the 
transformed nucleic acid fragments are referred to as "transgenic" organisms. Examples of 
methods of plant transformation include Agrobacterium-mediated transformation (De Blaere 
et al. (1987) Meth. Enzymol. 143:211) and particle-accelerated or "gene gun" transformation 

20 technology (Klein et al. (1987) Nature (London) 327:10-13; U.S. Patent No. 4,945,050, 
incorporated herein by reference). 

Standard recombinant DNA and molecular cloning techniques used herein are well 
known in the art and are described more fully in Sambrook et al. Molecular Cloning: A 
Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 

25 (hereinafter "Maniatis"). 

Nucleic acid fragments encoding at least a portion of a cellulose synthase enzyme have 
been isolated and identified by comparison of random plant cDNA sequences to public 
databases containing nucleotide and protein sequences using the BLAST algorithms well 
known to those skilled in the art. The nucleic acid fragments of the instant invention may be 

30 used to isolate cDNAs and genes encoding homologous proteins from the same or other 
plant species. Isolation of homologous genes using sequence-dependent protocols is well 
known in the art. Examples of sequence-dependent protocols include, but are not limited to, 
methods of nucleic acid hybridization, and methods of DNA and RNA amplification as 
exemplified by various uses of nucleic acid amplification technologies (e.g., polymerase 

35 chain reaction, ligase chain reaction). 

For example, genes encoding other cellulose synthase enzymes, either as cDNAs or 
genomic DNAs, could be isolated directly by using all or a portion of the instant nucleic acid 
fragments as DNA hybridization probes to screen libraries from any desired plant employing 

9 



WO 00/04166 



PCT/US99/15871 



methodology well known to those skilled in the art. Specific oligonucleotide probes based 
upon the instant nucleic acid sequences can be designed and synthesized by methods known 
in the art (Maniatis). Moreover, the entire sequences can be used directly to synthesize 
DNA probes by methods known to the skilled artisan such as random primer DNA labeling, 
5 nick translation, or end-labeling techniques, or RNA probes using available in vitro 

transcription systems. In addition, specific primers can be designed and used to amplify a 
part or all of the instant sequences. The resulting amplification products can be labeled 
directly during amplification reactions or labeled after amplification reactions, and used as 
probes to isolate full length cDNA or genomic fragments under conditions of appropriate 
10 stringency. 

In addition, two short segments of the instant nucleic acid fragments may be used in 
polymerase chain reaction protocols to amplify longer nucleic acid fragments encoding 
homologous genes from DNA or RNA. The polymerase chain reaction may also be 
performed on a library of cloned nucleic acid fragments wherein the sequence of one primer 

15 is derived from the instant nucleic acid fragments, and the sequence of the other primer takes 
advantage of the presence of the polyadenylic acid tracts to the 3' end of the mRNA 
precursor encoding plant genes. Alternatively, the second primer sequence may be based 
upon sequences derived from the cloning vector. For example, the skilled artisan can follow 
the RACE protocol (Frohman et al. (1988) Proc. Natl Acad Sci. USA 55:8998) to generate 

20 cDNAs by using PCR to amplify copies of the region between a single point in the 

transcript and the 3 1 or 5' end. Primers oriented in the 3' and 5' directions can be designed 
from the instant sequences. Using commercially available 3' RACE or 5 1 RACE systems 
(BRL), specific 3' or 5' cDNA fragments can be isolated (Ohara et al. (1989) Proc. Natl. 
Acad ScL USA 86:5673; Loh et al. (1989) Science 243:217). Products generated by the 3' 

25 and 5' RACE procedures can be combined to generate full-length cDNAs (Frohman and 
Martin (1989) Techniques 7:165). 

Availability of the instant nucleotide and deduced amino acid sequences facilitates 
immunological screening of cDNA expression libraries. Synthetic peptides representing 
portions of the instant amino acid sequences may be synthesized. These peptides can be 

30 used to immunize animals to produce polyclonal or monoclonal antibodies with specificity 
for peptides or proteins comprising the amino acid sequences. These antibodies can be then 
be used to screen cDNA expression libraries to isolate full-length cDNA clones of interest 
(Lerner (19%4) Adv. Immunol. 36:1; Maniatis). 

The nucleic acid fragments of the instant invention may be used to create transgenic 

35 plants in which the disclosed polypeptides are present at higher or lower levels than normal 
or in cell types or developmental stages in which they are not normally found. This would 
have the effect of altering the level of cellulose synthase in those cells. 
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Overexpression of the proteins of the instant invention may be accomplished by first 
constructing a chimeric gene in which the coding region is operably linked to a promoter 
capable of directing expression of a gene in the desired tissues at the desired stage of 
development For reasons of convenience, the chimeric gene may comprise promoter 
sequences and translation leader sequences derived from the same genes. 3' Non-coding 
sequences encoding transcription termination signals may also be provided. The instant 
chimeric gene may also comprise one or more introns in order to facilitate gene expression. 

Plasmid vectors comprising the instant chimeric gene can then constructed. The 
choice of plasmid vector is dependent upon the method that will be used to transform host 
plants. The skilled artisan is well aware of the genetic elements that must be present on the 
plasmid vector in order to successfully transform, select and propagate host cells containing 
the chimeric gene. The skilled artisan will also recognize that different independent 
transformation events will result in different levels and patterns of expression (Jones et al. 
(m5)EMBOJ. 4:2411-2418; De Almeida etal. (1989) Mol Gen. Genetics 275:78-86), and 
thus that multiple events must be screened in order to obtain lines displaying the desired 
expression level and pattern. Such screening may be accomplished by Southern analysis of 
DNA, Northern analysis of mRNA expression, Western analysis of protein expression, or 
phenotypic analysis. 

For some applications it may be useful to direct the instant polypeptides to different 
cellular compartments, or to facilitate its secretion from the cell. It is thus envisioned that 
the chimeric gene described above may be further supplemented by altering the coding 
sequence to encode the instant polypeptides with appropriate intracellular targeting 
sequences such as transit sequences (Keegstra (1989) Cell 56:247-253), signal sequences or 
sequences encoding endoplasmic reticulum localization (Chrispeels (1991) Ann. Rev. Plant 
Phys. Plant Mol. Biol -/2:21-53), or nuclear localization signals (Raikhel (1992) Plant 
Phys.100: 1627-1632) added and/or with targeting sequences that are already present 
removed. While the references cited give examples of each of these, the list is not 
exhaustive and more targeting signals of utility may be discovered in the future. 

It may also be desirable to reduce or eliminate expression of genes encoding the 
instant polypeptides in plants for some applications. In order to accomplish this, a chimeric 
gene designed for co-suppression of the instant polypeptide can be constructed by linking a 
gene or gene fragment encoding that polypeptide to plant promoter sequences. 
Alternatively, a chimeric gene designed to express antisense RNA for all or part of the 
instant nucleic acid fragment can be constructed by linking the gene or gene fragment in 
reverse orientation to plant promoter sequences. Either the co-suppression or antisense 
chimeric genes could be introduced into plants via transformation wherein expression of the 
corresponding endogenous genes are reduced or eliminated. 
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Molecular genetic solutions to the generation of plants with altered gene expression 
have a decided advantage over more traditional plant breeding approaches. Changes in plant 
phenotypes can be produced by specifically inhibiting expression of one or more genes by 
antisense inhibition or cosuppression (U. S. Patent Nos. 5,190,931, 5,107,065 and 
5 5,283,323). An antisense or cosuppression construct would act as a dominant negative 
regulator of gene activity. While conventional mutations can yield negative regulation of 
gene activity these effects are most likely recessive. The dominant negative regulation 
available with a transgenic approach may be advantageous from a breeding perspective. In 
addition, the ability to restrict the expression of specific phenotype to the reproductive 

10 tissues of the plant by the use of tissue specific promoters may confer agronomic 

advantages relative to conventional mutations which may have an effect in all tissues in 
which a mutant gene is ordinarily expressed. 

The person skilled in the art will know that special considerations are associated with 
the use of antisense or cosuppresion technologies in order to reduce expression of particular 

15 genes. For example, the proper level of expression of sense or antisense genes may require 
the use of different chimeric genes utilizing different regulatory elements known to the 
skilled artisan. Once transgenic plants are obtained by one of the methods described above, 
it will be necessary to screen individual transgenics for those that most effectively display 
the desired phenotype. Accordingly, the skilled artisan will develop methods for screening 

20 large numbers of transformants. The nature of these screens will generally be chosen on 
practical grounds, and is not an inherent part of the invention. For example ; one can screen 
by looking for changes in gene expression by using antibodies specific for the protein 
encoded by the gene being suppressed, or one could establish assays that specifically 
measure enzyme activity. A preferred method will be one which allows large numbers of 

25 samples to be processed rapidly, since it will be expected that a large number of 
transformants will be negative for the desired phenotype. 

The instant polypeptides (or portions thereof) may be produced in heterologous host 
cells, particularly in the cells of microbial hosts, and can be used to prepare antibodies to the 
these proteins by methods well known to those skilled in the art. The antibodies are useful 

30 for detecting the polypeptides of the instant invention in situ in cells or in vitro in cell 
extracts. Preferred heterologous host cells for production of the instant polypeptides are 
microbial hosts. Microbial expression systems and expression vectors containing regulatory 
sequences that direct high level expression of foreign proteins are well known to those 
skilled in the art. Any of these could be used to construct a chimeric gene for production of 

35 the instant polypeptides. This chimeric gene could then be introduced into appropriate 

microorganisms via transformation to provide high level expression of the encoded cellulose 
synthase. An example of a vector for high level expression of the instant polypeptides in a 
bacterial host is provided (Example 6). 
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Additionally, the instant polypeptides can be used as a targets to facilitate design 
and/or identification of inhibitors of those enzymes that may be useful as herbicides. This is 
desirable because the polypeptides described herein catalyze a step in the synthesis of 
cellulose. Accordingly, inhibition of the activity of one or more of the enzymes described 
herein could lead to inhibition plant growth. Thus, the instant polypeptides could be 
appropriate for new herbicide discovery and design. 

All or a substantial portion of the nucleic acid fragments of the instant invention may 
also be used as probes for genetically and physically mapping the genes that they are a part 
of, and as markers for traits linked to those genes. Such information may be useful in plant 
breeding in order to develop lines with desired phenotypes. For example, the instant nucleic 
acid fragments may be used as restriction fragment length polymorphism (RFLP) markers. 
Southern blots (Maniatis) of restriction-digested plant genomic DNA may be probed with 
the nucleic acid fragments of the instant invention. The resulting banding patterns may then 
be subjected to genetic analyses using computer programs such as MapMaker (Lander et al. 
(1987) Genomics 1 : 1 74-1 8 1 ) in order to construct a genetic map. In addition, the nucleic 
acid fragments of the instant invention may be used to probe Southern blots containing 
restriction endonuclease-treated genomic DNAs of a set of individuals representing parent 
and progeny of a defined genetic cross. Segregation of the DNA polymorphisms is noted 
and used to calculate the position of the instant nucleic acid sequence in the genetic map 
previously obtained using this population (Botstein et al. (1980) Am, J. Hum, Genet 
52:314-331). 

The production and use of plant gene-derived probes for use in genetic mapping is 
described in Bernatzky and Tanksley (1986) Plant Mol, Biol Reporter 4(l):Zl-4\. 
Numerous publications describe genetic mapping of specific cDNA clones using the 
methodology outlined above or variations thereof. For example, F2 intercross populations, 
backcross populations, randomly mated populations, near isogenic lines, and other sets of 
individuals may be used for mapping. Such methodologies are well known to those skilled 
in the art. 

Nucleic acid probes derived from the instant nucleic acid sequences may also be used 
for physical mapping (i.e., placement of sequences on physical maps; see Hoheisel et al. In: 
Nonmammalian Genomic Analysis: A Practical Guide, Academic press 1996, pp. 319-346, 
and references cited therein). 

In another embodiment, nucleic acid probes derived from the instant nucleic acid 
sequences may be used in direct fluorescence in situ hybridization (FISH) mapping (Trask 
(1991) Trends Genet. 7:149-154). Although current methods of FISH mapping favor use of 
large clones (several to several hundred KB; see Laan et al. (1995) Genome Research 
5:13-20), improvements in sensitivity may allow performance of FISH mapping using 
shorter probes. 
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A variety of nucleic acid amplification-based methods of genetic and physical 
mapping may be carried out using the instant nucleic acid sequences. Examples include 
allele-specific amplification (Kazazian (1989) J. Lab. Clin. Med 114(2):95-96), 
polymorphism of PCR-amplified fragments (CAPS; Sheffield et al. (1993) Genomics 
5 75:325-332), allele-specific ligation (Landegren et al. (1988) Science 247:1077-1080), 
nucleotide extension reactions (Sokolov (1990) Nucleic Acid Res. 75:3671), Radiation 
Hybrid Mapping (Walter et al. (1997) Nature Genetics 7:22-28) and Happy Mapping (Dear 
and Cook (1989) Nucleic Acid Res. 1 7:6795-6807). For these methods, the sequence of a 
nucleic acid fragment is used to design and produce primer pairs for use in the amplification 

10 reaction or in primer extension reactions. The design of such primers is well known to those 
skilled in the art. In methods employing PCR-based genetic mapping, it may be necessary 
to identify DNA sequence differences between the parents of the mapping cross in the 
region corresponding to the instant nucleic acid sequence. This, however, is generally not 
necessary for mapping methods. 

15 Loss of function mutant phenotypes may be identified for the instant cDNA clones 

either by targeted gene disruption protocols or by identifying specific mutants for these 
genes contained in a maize population carrying mutations in all possible genes (Ballinger 
and Benzer (1989) Proc. Natl Acad. Sci USA 55:9402; Koes et al. (1995) Proc. Natl Acad 
Sci USA 92:8149; Bensen et al. (1995) Plant Cell 7:75). The latter approach may be 

20 accomplished in two ways. First, short segments of the instant nucleic acid fragments may 
be used in polymerase chain reaction protocols in conjunction with a mutation tag sequence 
primer on DNAs prepared from a population of plants in which Mutator transposons or some 
other mutation-causing DNA element has been introduced (see Bensen, supra). The 
amplification of a specific DNA fragment with these primers indicates the insertion of the 

25 mutation tag element in or near the plant gene encoding the instant polypeptides. 

Alternatively, the instant nucleic acid fragment may be used as a hybridization probe against 
PCR amplification products generated from the mutation population using the mutation tag 
sequence primer in conjunction with an arbitrary genomic site primer, such as that for a 
restriction enzyme site-anchored synthetic adaptor. With either method, a plant containing a 

30 mutation in the endogenous gene encoding the instant polypeptides can be identified and 

obtained. This mutant plant can then be used to determine or confirm the natural function of 
the instant polypeptides disclosed herein. 

EXAMPLES 

The present invention is further defined in the following Examples, in which all parts 
35 and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be 
understood that these Examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above discussion and these Examples, one 
skilled in the art can ascertain the essential characteristics of this invention, and without 
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departing from the spirit and scope thereof, can make various changes and modifications of 
the invention to adapt it to various usages and conditions. 

EXAMPLE 1 

Composition of cDNA Libraries: Isol ation and Sequencing of cDNA Clones 
5 cDNA libraries representing mRNAs from various barley, corn, rice, soybean and 

wheat tissues were prepared. The characteristics of the libraries are described below. 

TABLE 2 

cDNA Libraries from Barley, Corn, Rice, Soybean and Wheat 



Library 



Tissue 



Clone 



bshl 
ccoln 

cdt2c 

crln 
csclc 

p0031 

pOUO 

p0097 
p0122 
rlr24 

sdp2c 
ses8w 

ssl 
wll 

wlln 

wrl 



Barley (Hordeum vulgare) sheath, developing seedling 

Corn (Zea mays) cob of 67 day old plants grown in green 
house* 

Corn (Zea mays) developing tassel 2 



Corn (Zea mays) root from 7 day seedlings grown in light* 

Corn (Zea mays) 20 day seedling (germination under cold 
stress) 

Corn (Zea mays) shoot culture, initiated from seed derived 
meristems culture was maintained on 273N medium. 
Corn (Zea mays) stages V3 /V4** leaf tissue minus midrib 
harvested 4 hours, 24 hours and 7 days after infiltration 
with salicylic acid, tissues pooled* 

Corn (Zea mays) stage V9** whorl section (7 cm) from 
plant infected four times with european corn borer 
Corn (Zea mays) pith tissue collected from internode 
subtending ear node 5 days after pollenation 
Rice (Oryza sativa) leaf (15 days after germination) 
24 hours after infection of Magaporthe grisea strain 
4360-R-62 (AVR2-YAMO); Resistant 

Soybean (Glycine max) developing pods 6-7 mm 

Soybean (Glycine max) mature embryo 8 weeks after 
subculture 

Soybean (Glycine max) seedling 5-10 day 

Wheat (Triticum aestivum) leaf 7 day old seedling, light 
grown 

Wheat (Triticum aestivum) leaf 7 day old seedling, light 
grown* 

Wheat (Triticum aestivum) root; 7 day old seedling, light 
grown 



bshl.pk0002.f6 
ccoln.pk0005.g3 

cdt2c.pk002.gl 
cdt2c.pk002.116 
crln.pk0135.el0 
csclc.pk002.il 

p0031.ccmar05rb 

p0110.cgsma57r 

p0097.cqradl7rc 

p0122.ckamh70rc 

rlr24.pk0073.gl 

sdp2c.pk005.o22 
ses8w.pkO028.f3 

ssl.pk0036.cl0 
wll.pk0009.c9 

wlln.pk0044.bl 

wrl.pk0160.dll 
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Library Tissue 




wreln Wheat (Triticum aestivum) root: 7 day old etiolated 


wrein.pKuu4j.rv 


seedling* 




wreln.pk0043.h8 




wreln.pk0131.gl0 



♦These libraries were normalized essentially as described in U.S. Patent No. 5,482,845, 
incorporated herein by reference. 

**V3, V4 and V9 refer to stages of corn growth. The descriptions can be found in "How 
5 a Corn Plant Develops" Special Report No. 48, Iowa State University of Science and 
Technology Cooperative Extension Service Ames, Iowa, Reprinted February 1993. 



cDNA libraries may be prepared by any one of many methods available. For 
example, the cDNAs may be introduced into piasmid vectors by first preparing the cDNA 

10 libraries in Uni-ZAP™ XR vectors according to the manufacturer's protocol (Stratagene 
Cloning Systems, La Jolla, CA). The Uni-ZAP™ XR libraries are converted into piasmid 
libraries according to the protocol provided by Stratagene. Upon conversion, cDNA inserts 
will be contained in the piasmid vector pBluescript. In addition, the cDNAs may be 
introduced directly into precut Bluescript II SK(+) vectors (Stratagene) using T4 DNA 

15 ligase (New England Biolabs), followed by transfection into DH10B cells according to the 
manufacturer's protocol (GIBCO BRL Products). Once the cDNA inserts are in piasmid 
vectors, piasmid DNAs are prepared from randomly picked bacterial colonies containing 
recombinant pBluescript plasmids, or the insert cDNA sequences are amplified via 
polymerase chain reaction using primers specific for vector sequences flanking the inserted 

20 cDNA sequences. Amplified insert DNAs or piasmid DNAs are sequenced in dye-primer 
sequencing reactions to generate partial cDNA sequences (expressed sequence tags or 
"ESTs"; see Adams et al., (1991) Science 252:1651). The resulting ESTs are analyzed using 
a Perkin Elmer Model 377 fluorescent sequencer. 

EXAMPLE 2 

25 Identification of cDNA Clones 

cDNA clones encoding cellulose synthase enzymes were identified by conducting 
BLAST (Basic Local Alignment Search Tool; Altschul et al. (1993) J. Mol Biol 
275:403-410; see also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences 
contained in the BLAST "nr" database (comprising all non-redundant GenBank CDS 

30 translations, sequences derived from the 3 -dimensional structure Brookhaven Protein Data 
Bank, the last major release of the SWISS-PROT protein sequence database, EMBL, and 
DDBJ databases). The cDNA sequences obtained in Example 1 were analyzed for similarity 
to all publicly available DNA sequences contained in the "nr" database using the BLASTN 
algorithm provided by the National Center for Biotechnology Information (NCBI). The 
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DNA sequences were translated in all reading frames and compared for similarity to all 
publicly available protein sequences contained in the "nr" database using the BLASTX 
algorithm (Gish and States (1993) Nature Genetics 3:266-272) provided by the NCBI. For 
convenience, the P-value (probability) of observing a match of a cDNA sequence to a 
5 sequence contained in the searched databases merely by chance as calculated by BLAST are 
reported herein as "pLog" values, which represent the negative of the logarithm of the 
reported P-value. Accordingly, the greater the pLog value, the greater the likelihood that the 
cDNA sequence and the BLAST "hit" represent homologous proteins. 

EXAMPLE 3 

10 Characterization of cDNA nones Encoding Cellulose Synthase 

The BLASTX search using the EST sequences from clones listed in Table 3 revealed 
similarity of the polypeptides encoded by the cDNAs to cellulose synthase from Arabidopsis 
thaliana (NCBI Identifier No. gi 2827139, gi 2827141, gi 4467125, gi 4886756 and gi 
313561 1) and Gossypium hirsutum (NCBI Identifier No. gi 1706958 and 5081779). Shown 

15 in Table 3 are the BLAST results for individual ESTs ("EST"), the sequences of the entire 
cDNA inserts comprising the indicated cDNA clones ("FIS"), complete gene sequences 
("CGS") or contigs assembled from two or more ESTs ("Contig"): 

TABLE 3 

20 BLAST Results for Sequences Encoding Polypeptides Homologous 

to Arabidopsis thaliana and Gossypium hirsutum Cellulose Synthase 



Clone Status BLAST pLog Score 



bshl.pk0002.f6 


FIS 


154.00 (gi 2827139) 


Contig composed of: 
ccoln.pk0005.g3 
cdt2c.pk002.gl 
cdt2c.pk002.116 
csclc.pk002.il 
p0031.ccmar05rb 
p0110.cgsma57r 


Contig 


>254.00 (gi 2827141) 


crln.pk0135.el0 


FIS 


176.00 (gi 1706958) 


p0097.cqradl7rc 


CGS 


>254.00(gi 2827141) 


p0122.ckamh70rc 


CGS 


>254.00(gi 2827141) 


rlr24.pk0073.gl 


EST 


77.70 (gi 4467125) 


sdp2c.pk005.o22 


FIS 


>254.00 (gi 4886756) 


ses8w.pk0028.f3 


EST 


>254.00 (gi 2827139) 


ssl.pk0036.cl0 


EST 


>254.00(gi 2827141) 


Contig composed of: 
wll.pk0009.c9 
wrl.pk0160.dll 
wreln.pk0043.f9 


Contig 


>254.00 (gi 5081779) 
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wreln.pk0043.h8 
wreln.pk0131.gl0 

wlln.pk0044.bl EST 166.00 (gi 3135611) 

Figure 1 presents an alignment of the amino acid sequences set forth in SEQ ID 
NOs:2, 4, 8, 10, 12, 14, 16, 18, 20 and 22 and the Arabidopsis thaliana (SEQ ID NOs:23 
(gi 2827139), 24 (gi 2827141), 26 (gi 4467125), 27 (gi 4886756) and 29 (gi 313561 1) ) and 
Gossypium hirsutum (SEQ ID NOs:25 (gi 1 706958) and 28 (gi 5081779)) sequences. The 
data in Table 4 represents a calculation of the percent identity of the amino acid sequences 
set forth in SEQ ID NOs:2, 4, 8, 10, 12, 14, 16, 18, 20 and 22 and the Arabidopsis thaliana 
(SEQ ID NOs:23, 24, 26, 27 and 29) and Gossypium hirsutum (SEQ ID NOs:25 and 28) 
sequences. 

TABLE 4 



Percent Identity of Amino Acid Sequences Deduced From the Nucleotide Sequences 

of cDNA Clones Encoding Polypeptides Homologous 
to Arabidopsis thaliana and Gossypium hirsutum Cellulose Synthase 



SEQ ID NO. 


Percent Identity to 


2 


82% (gi 2827139) 


4 


69% (gi 2827141) 


6 


89% (gi 1706958) 


8 


70% (gi 2827141) 


10 


70% (gi 2827141) 


12 


36% (gi 4467125) 


14 


86% (gi 4886756) 


16 


88% (gi 2827139) 


18 


86% (gi 2827141) 


20 


87% (gi 5081779) 


22 


70% (gi 3135611) 



Sequence alignments and percent identity calculations were performed using the 
Megalign program of the LASARGENE bioinformatics computing suite (DNASTAR Inc., 
Madison, WI). Multiple alignment of the sequences was performed using the Clustal 
method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default 
parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default parameters for 
pairwise alignments using the Clustal method were KTUPLE 1, GAP PENALTY=3, 
WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments and BLAST scores and 
probabilities indicate that the nucleic acid fragments comprising the instant cDNA clones 
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encode a substantial portion of a cellulose synthase. These sequences represent the first 
barley, corn, rice, soybean and wheat sequences encoding cellulose synthase. 

EXAMPLE 4 
Expression of Chimeric Genes in Monocot Cells 
5 A chimeric gene comprising a cDNA encoding the instant polypeptides in sense 

orientation with respect to the maize 27 kD zein promoter that is located 5' to the cDNA 
fragment, and the 10 kD zein 3* end that is located 3' to the cDNA fragment, can be 
constructed. The cDNA fragment of this gene may be generated by polymerase chain 
reaction (PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites 

10 (Ncol or Smal) can be incorporated into the oligonucleotides to provide proper orientation 
of the DNA fragment when inserted into the digested vector pML103 as described below. 
Amplification is then performed in a standard PCR. The amplified DNA is then digested 
with restriction enzymes Ncol and Smal and fractionated on an agarose gel. The appropriate 
band can be isolated from the gel and combined with a 4.9 kb Ncol-Smal fragment of the 

15 plasmid pML103. Plasmid pML103 has been deposited under the terms of the Budapest 
Treaty at ATCC (American Type Culture Collection, 10801 University Blvd., Manassas, 
VA 201 10-2209), and bears accession number ATCC 97366. The DNA segment from 
pML103 contains a 1 .05 kb Sall-Ncol promoter fragment of the maize 27 kD zein gene and 
a 0.96 kb Smal-Sall fragment from the 3' end of the maize 10 kD zein gene in the vector 

20 pGem9Zf(+) (Promega). Vector and insert DNA can be ligated at 1 5°C overnight, 

essentially as described (Maniatis). The ligated DNA may then be used to transform £ coli 
XLl-Blue (Epicurian Coli XL-1 Blue™; Straiagene). Bacterial transformants can be 
screened by restriction enzyme digestion of plasmid DNA and limited nucleotide sequence 
analysis using the dideoxy chain termination method (Sequenase™ DNA Sequencing Kit; 

25 U.S. Biochemical). The resulting plasmid construct would comprise a chimeric gene 
encoding, in the 5' to 3' direction, the maize 27 kD zein promoter, a cDNA fragment 
encoding the instant polypeptides, and the 10 kD zein 3' region. 

The chimeric gene described above can then be introduced into corn cells by the 
following procedure. Immature corn embryos can be dissected from developing caryopses 

30 derived from crosses of the inbred corn lines H99 and LH132. The embryos are isolated 10 
to 1 1 days after pollination when they are 1 .0 to 1 .5 mm long. The embryos are then placed 
with the axis-side facing down and in contact with agarose-solidified N6 medium (Chu et al. 
(1975) Sci. Sin. Peking 18:659-668). The embryos are kept in the dark at 27°C. Friable 
embryogenic callus consisting of undifferentiated masses of cells with somatic 

35 proembryoids and embryoids borne on suspensor structures proliferates from the scutellum 
of these immature embryos. The embiyogenic callus isolated from the primary explant can 
be cultured on N6 medium and sub-cultured on this medium every 2 to 3 weeks. 
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The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag, Frankfurt, 
Germany) may be used in transformation experiments in order to provide for a selectable 
marker. This plasmid contains the Pat gene (see European Patent Publication 0 242 236) 
which encodes phosphinothricin acetyl transferase (PAT). The enzyme PAT confers 
5 resistance to herbicidal glutamine synthetase inhibitors such as phosphinothricin. The pat 
gene in p35S/Ac is under the control of the 35S promoter from Cauliflower Mosaic Virus 
(Odell et al. (1985) Nature 313:810-812) and the 3' region of the nopaline synthase gene 
from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. 

The particle bombardment method (Klein et al. (1987) Nature 327:70-73) may be used 

10 to transfer genes to the callus culture cells. According to this method, gold particles (1 nm 
in diameter) are coated with DNA using the following technique. Ten ng of plasmid DNAs 
are added to 50 nL of a suspension of gold particles (60 mg per mL). Calcium chloride 
(50 \iL of a 2.5 M solution) and spermidine free base (20 \xL of a 1.0 M solution) are added 
to the particles. The suspension is vortexed during the addition of these solutions. After 

15 10 minutes, the tubes are briefly centrifuged (5 sec at 1 5,000 rpm) and the supernatant 
removed. The particles are resuspended in 200 |iL of absolute ethanol, centrifuged again 
and the supernatant removed. The ethanol rinse is performed again and the particles 
resuspended in a final volume of 30 jiL of ethanol. An aliquot (5 fiL) of the DNA-coated 
gold particles can be placed in the center of a Kapton™ flying disc (Bio-Rad Labs). The 

20 particles are then accelerated into the corn tissue with a Biolistic™ PDS- 1 000/He (Bio-Rad 
Instruments, Hercules CA), using a helium pressure of 1000 psi, a gap distance of 0.5 cm 
and a flying distance of 1 .0 cm. 

For bombardment, the embryogenic tissue is placed on filter paper over agarose- 
solidified N6 medium. The tissue is arranged as a thin lawn and covered a circular area of 

25 about 5 cm in diameter. The petri dish containing the tissue can be placed in the chamber of 
the PDS- 1 000/He approximately 8 cm from the stopping screen. The air in the chamber is 
then evacuated to a vacuum of 28 inches of Hg. The macrocarrier is accelerated with a 
helium shock wave using a rupture membrane that bursts when the He pressure in the shock 
tube reaches 1000 psi. 

30 Seven days after bombardment the tissue can be transferred to N6 medium that 

contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue continues to 
grow slowly on this medium. After an additional 2 weeks the tissue can be transferred to 
fresh N6 medium containing gluphosinate. After 6 weeks, areas of about 1 cm in diameter 
of actively growing callus can be identified on some of the plates containing the glufosinate- 

35 supplemented medium. These calli may continue to grow when sub-cultured on the 
selective medium. 

Plants can be regenerated from the transgenic callus by first transferring clusters of 
tissue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the 
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tissue can be transferred to regeneration medium (Fromm et al. (1990) Bio/Technology 
5:833-839). 

EXAMPLE 5 
Expression of Chim eric Genes in Dicot Cells 
5 A seed-specific expression cassette composed of the promoter and transcription 

terminator from the gene encoding the p subunit of the seed storage protein phaseolin from 
the bean Phaseolw vulgaris (Doyle et al. (1986) J. Biol. Chem. 261 :9228-9238) can be used 
for expression of the instant polypeptides in transformed soybean. The phaseolin cassette 
includes about 500 nucleotides upstream (5') from the translation initiation codon and about 
10 1650 nucleotides downstream (3') from the translation stop codon of phaseolin. Between the 
5' and 3' regions are the unique restriction endonuclease sites Nco I (which includes the ATG 
translation initiation codon), Sma I, Kpn I and Xba I. The entire cassette is flanked by 
Hind III sites. 

The cDNA fragment of this gene may be generated by polymerase chain reaction 
15 (PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites can be 
incorporated into the oligonucleotides to provide proper orientation of the DNA fragment 
when inserted into the expression vector. Amplification is then performed as described 
above, and the isolated fragment is inserted into a pUC18 vector carrying the seed 
expression cassette. 

Soybean embroys may then be transformed with the expression vector comprising 
sequences encoding the instant polypeptides. To induce somatic embryos, cotyledons, 
3-5 mm in length dissected from surface sterilized, immature seeds of the soybean cultivar 
A2872, can be cultured in the light or dark at 26°C on an appropriate agar medium for 
6-10 weeks. Somatic embryos which produce secondary embryos are then excised and 
placed into a suitable liquid medium. After repeated selection for clusters of somatic 
embryos which multiplied as early, globular staged embryos, the suspensions are maintained 
as described below. 

Soybean embryogenic suspension cultures can maintained in 35 mL liquid media on a 
rotary shaker, 1 50 rpm, at 26°C with florescent lights on a 16:8 hour day/night schedule. 
Cultures are subcultured every two weeks by inoculating approximately 35 mg of tissue into 
35 mL of liquid medium. 

Soybean embryogenic suspension cultures may then be transformed by the method of 
particle gun bombardment (Klein et al. (1987) Nature (London) 327:70, U.S. Patent 
No. 4,945,050). A DuPont Biolistic™ PDS1000/HE instrument (helium retrofit) can be used 
for these transformations. 

A selectable marker gene which can be used to facilitate soybean transformation is a 
chimeric gene composed of the 35S promoter from Cauliflower Mosaic Virus (Odell et al. 
(1985) Nature 3/3:810-812), the hygromycin phosphotransferase gene from plasmid pJR225 
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(from £ coir, Gritz et al.(1983) Gene 25:179-188) and the 3 1 region of the nopaline synthase 
gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. The seed expression 
cassette comprising the phaseolin 5* region, the fragment encoding the instant polypeptides 
and the phaseolin 3' region can be isolated as a restriction fragment. This fragment can then 
5 be inserted into a unique restriction site of the vector carrying the marker gene. 

To 50 nL of a 60 mg/mL 1 \xm gold particle suspension is added (in order): 5 nL 
DNA (1 \ig/\iL), 20 pd spermidine (0. 1 M), and 50 nL CaCl 2 (2.5 M). The particle 
preparation is then agitated for three minutes, spun in a microfuge for 10 seconds and the 
supernatant removed. The DNA-coated particles are then washed once in 400 jiL 70% 

10 ethanol and resuspended in 40 of anhydrous ethanol. The DNA/particle suspension can 
be sonicated three times for one second each. Five nL of the DNA-coated gold particles are 
then loaded on each macro carrier disk. 

Approximately 300-400 mg of a two-week-old suspension culture is placed in an 
empty 60x15 mm petri dish and the residual liquid removed from the tissue with a pipette. 

1 5 For each transformation experiment, approximately 5-10 plates of tissue are normally 

bombarded. Membrane rupture pressure is set at 1 1 00 psi and the chamber is evacuated to a 
vacuum of 28 inches mercury. The tissue is placed approximately 3.5 inches away from the 
retaining screen and bombarded three times. Following bombardment, the tissue can be 
divided in half and placed back into liquid and cultured as described above. 

20 Five to seven days post bombardment, the liquid media may be exchanged with fresh 

media, and eleven to twelve days post bombardment with fresh media containing 50 mg/mL 
hygromycin. This selective media can be refreshed weekly. Seven to eight weeks post 
bombardment, green, transformed tissue may be observed growing from untransformed, 
necrotic embryogenic clusters. Isolated green tissue is removed and inoculated into 

25 individual flasks to generate new, clonally propagated, transformed embryogenic suspension 
cultures. Each new line may be treated as an independent transformation event. These 
suspensions can then be subcultured and maintained as clusters of immature embryos or 
regenerated into whole plants by maturation and germination of individual somatic embryos. 

EXAMPLE 6 

30 Expression of Chimeric Genes in Microbial Cells 

The cDNAs encoding the instant polypeptides can be inserted into the T7 E. coli 
expression vector pBT430. This vector is a derivative of pET-3a (Rosenberg et al. (1987) 
Gene 5(5:125-135) which employs the bacteriophage T7 RNA polymerase/T7 promoter 
system. Plasmid pBT430 was constructed by first destroying the EcoR I and Hind III sites in 

35 pET-3a at their original positions. An oligonucleotide adaptor containing EcoR I and 
Hind III sites was inserted at the BamH I site of pET-3a. This created pET-3aM with 
additional unique cloning sites for insertion of genes into the expression vector. Then, the 
Nde I site at the position of translation initiation was converted to an Nco I site using 
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oUgonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this region, 
5'-CATATGG, was converted to 5'-CCCATGG in pBT430. 

Plasmid DNA containing a cDNA may be appropriately digested to release a nucleic 
acid fragment encoding the protein. This fragment may then be purified on a 1% NuSieve 
GTG™ low melting agarose gel (FMC). Buffer and agarose contain 10 ug/ml ethidium 
bromide for visualization of the DNA fragment The fragment can then be purified from the 
agarose gel by digestion with GELase™ (Epicentre Technologies) according to the 
manufacturer's instructions, ethanol precipitated, dried and resuspended in 20 uL of water. 
Appropriate oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 
(New England Biolabs, Beverly, MA). The fragment containing the ligated adapters can be 
purified from the excess adapters using low melting agarose as described above. The vector 
pBT430 is digested, dephosphorylated with alkaline phosphatase (NEB) and deproteinized 
with phenol/chloroform as described above. The prepared vector pBT430 and fragment can 
then be ligated at 16°C for 15 hours followed by transformation into DH5 electrocompetent 
cells (GIBCO BRL). Transformants can be selected on agar plates containing LB media and 
100 ug/mL ampicillin. Transformants containing the gene encoding the instant polypeptides 
are then screened for the correct orientation with respect to the T7 promoter by restriction 
enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the correct 
orientation relative to the T7 promoter can be transformed into K coli strain BL21(DE3) 
(Studier et al. (1986) J. Mol. Biol. 189:1 13-130). Cultures are grown in LB medium 
containing ampicillin (100 mg/L) at 25°C. At an optical density at 600 nm of approximately 
1, IPTG (isopropylthio-p-galactoside, the inducer) can be added to a final concentration of 
0.4 mM and incubation can be continued for 3 h at 25°. Cells are then harvested by 
centrifugation and re-suspended in 50 uL of 50 mM Tris-HCl at pH 8.0 containing 0.1 mM 
DTT and 0.2 mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glass beads can 
be added and the mixture sonicated 3 times for about 5 seconds each time with a microprobe 
sonicator. The mixture is centrifuged and the protein concentration of the supernatant 
determined. One \ig of protein from the soluble fraction of the culture can be separated by 
SDS-polyacrylamide gel electrophoresis. Gels can be observed for protein bands migrating 
at the expected molecular weight. 

EXAMPLE 7 
Evaluating Comp ounds for Their Ability to Inhibit 
the Activity of Cellulose Synthase 
The polypeptides described herein may be produced using any number of methods 
known to those skilled in the art. Such methods include, but are not limited to, expression in 
bacteria as described in Example 6, or expression in eukaryotic cell culture, in planta, and 
using viral expression systems in suitably infected organisms or cell lines. The instant 
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polypeptides may be expressed either as mature forms of the proteins as observed in vivo or 
as fusion proteins by covalent attachment to a variety of enzymes, proteins or affinity tags. 
Common fusion protein partners include glutathione S-transferase ("GST'), thiorcdoxin 
("Trx"), maltose binding protein, and C- and/or N-terminal hexahistidine polypeptide 
5 ("(His) 6 "). The fusion proteins may be engineered with a protease recognition site at the 
fusion point so that fusion partners can be separated by protease digestion to yield intact 
mature enzyme. Examples of such proteases include thrombin, enterokinase and factor Xa. 
However, any protease can be used which specifically cleaves the peptide connecting the 
fusion protein and the enzyme. 

10 Purification of the instant polypeptides, if desired, may utilize any number of 

separation technologies familiar to those skilled in the art of protein purification. Examples 
of such methods include, but are not limited to, homogenization, filtration, centrifugation, 
heat denaturation, ammonium sulfate precipitation, desalting, pH precipitation, ion exchange 
chromatography, hydrophobic interaction chromatography and affinity chromatography, 

15 wherein the affinity ligand represents a substrate, substrate analog or inhibitor. When the 
instant polypeptides are expressed as fusion proteins, the purification protocol may include 
the use of an affinity resin which is specific for the fusion protein tag attached to the 
expressed enzyme or an affinity resin containing ligands which are specific for the enzyme. 
For example, the instant polypeptides may be expressed as a fusion protein coupled to the 

20 C-terminus of thioredoxin. In addition, a (His) 6 peptide may be engineered into the 

N-tenninus of the fused thioredoxin moiety to afford additional opportunities for affinity 
purification. Other suitable affinity resins could be synthesized by linking the appropriate 
ligands to any suitable resin such as Sepharose-4B. In an alternate embodiment, a 
thioredoxin fusion protein may be eluted using dithiothreitol; however, elution may be 

25 accomplished using other reagents which interact to displace the thioredoxin from the resin. 
These reagents include p-mercaptoethanol or other reduced thiol. The eluted fusion protein 
may be subjected to further purification by traditional means as stated above, if desired. 
Proteolytic cleavage of the thioredoxin fusion protein and the enzyme may be accomplished 
after the fusion protein is purified or while the protein is still bound to the ThioBond™ 

30 affinity resin or other resin. 

Crude, partially purified or purified enzyme, either alone or as a fusion protein, may be 
utilized in assays for the evaluation of compounds for their ability to inhibit enzymatic 
activation of the instant polypeptides disclosed herein. Assays may be conducted under well 
known experimental conditions which permit optimal enzymatic activity. For example, 

35 assays for cellulose synthase activity are presented in WO 98/1 8949 and WO 98/00549. 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid fragment comprising at least 900 nucleotides, wherein 
the nucleic acid fragment encodes a cellulose synthase comprising a member selected from 
the group consisting of: 

(a) an isolated nucleic acid fragment encoding an amino acid sequence that 
is at least 90% identical to the amino acid sequence set forth in a 
member selected from the group consisting of SEQ ID NO:2, 6, 12, 14, 
16, 18, 20 and 22; 

(b) an isolated nucleic acid fragment that is complementary to (a). 

2. The isolated nucleic acid fragment of Claim 1 wherein nucleic acid fragment is 
a functional RNA. 

3. The isolated nucleic acid fragment of Claim 1 wherein the nucleotide sequence 
of the fragment comprises the sequence set forth in a member selected from the group 
consisting of SEQ ID NO: 1, 5, 11, 13, 15, 17, 19 and 21. 

4. A chimeric gene comprising the nucleic acid fragment of Claim 1 operably 
linked to suitable regulatory sequences. 

5. A transformed host cell comprising the chimeric gene of Claim 4. 

6. A cellulose synthase polypeptide comprising all or a substantial portion of the 
amino acid sequence set forth in a member selected from the group consisting of SEQ ID 
NO:2, 6, 1 2, 1 4, 1 6, 1 8, 20 and 22. 

7. An isolated nucleic acid fragment encoding a cellulose synthase comprising a 
member selected from the group consisting of: 

(a) an isolated nucleic acid fragment encoding an amino acid sequence that 
is functionally acitve polypeptide and at least 80% identical to the 
amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID NO:4, 8 and 10 ; 

(b) an isolated nucleic acid fragment that is complementary to (a). 

8. The isolated nucleic acid fragment of Claim 7 wherein nucleic acid fragment is 
a functional RNA. 

9. The isolated nucleic acid fragment of Claim 7 wherein the nucleotide sequence 
of the fragment comprises the sequence set forth in a member selected from the group 
consisting of SEQ ID NO:3, 7 and 9. 

1 0. A chimeric gene comprising the nucleic acid fragment of Claim 7 operably 
linked to suitable regulatory sequences. 

11. A transformed host cell comprising the chimeric gene of Claim 1 0. 



25 



WO 00/04166 



PCT/US99/15871 



1 2. A cellulose synthase polypeptide comprising all or a substantial portion of the 
amino acid sequence set forth in a member selected from the group consisting of SEQ ID 
NO:4,8,10. 

13. A method of altering the level of expression of a cellulose synthase in a host 
cell comprising: 

(a) transforming a host cell with the chimeric gene of any of Claims 4 and 
10; and 

(b) growing the transformed host cell produced in step (a) under conditions 
that are suitable for expression of the chimeric gene 

wherein expression of the chimeric gene results in production of altered levels of a cellulose 
synthase in the transformed host cell. 

14. A method of obtaining a nucleic acid fragment encoding all or a substantial 
portion of the amino acid sequence encoding a cellulose synthase comprising: 

(a) probing a cDNA or genomic library with the nucleic acid fragment of 
any of Claims 1 and 7; 

(b) identifying a DNA clone that hybridizes with the nucleic acid fragment 
any of of Claims land 7; 

(c) isolating the DNA clone identified in step (b); and 

(d) sequencing the cDNA or genomic fragment that comprises the clone 
isolated in step (c) 

wherein the sequenced nucleic acid fragment encodes all or a substantial portion of the 
amino acid sequence encoding a cellulose synthase. 

15. A method of obtaining a nucleic acid fragment encoding a substantial portion 
of an amino acid sequence encoding a cellulose synthase comprising: 

(a) synthesizing an oligonucleotide primer corresponding to a portion of 
the sequence set forth in any of SEQ ID NOs:l, 3, 5, 7, 9, 11, 13, 15, 
17, 19 and 21; and 

(b) amplifying a cDNA insert present in a cloning vector using the 
oligonucleotide primer of step (a) and a primer representing sequences 
of the cloning vector 

wherein the amplified nucleic acid fragment encodes a substantial portion of an amino acid 
sequence encoding a cellulose synthase. 

1 6. The product of the method of Claim 1 4. 

1 7. The product of the method of Claim 15. 

1 8. A method for evaluating at least one compound for its ability to inhibit the 
activity of a cellulose synthase, the method comprising the steps of: 
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(a) transforming a host cell with a chimeric gene comprising a nucleic acid 
fragment encoding a cellulose synthase, operably linked to suitable 
regulatory sequences; 

(b) growing the transformed host cell under conditions that are suitable for 
expression of the chimeric gene wherein expression of the chimeric 
gene results in production of the cellulose synthase encoded by the 
operably linked nucleic acid fragment in the transformed host cell; 

(c) optionally purifying the cellulose synthase expressed by the 
transformed host cell; 

(d) treating the cellulose synthase with a compound to be tested; and 

(e) comparing the activity of the cellulose synthase that has been treated 
with a test compound to the activity of an untreated cellulose synthase, 

thereby selecting compounds with potential for inhibitory activity. 
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SEQ ID NO: 2 
SEQ ID NO: 4 
SEQ ID NO: 6 
SEQ ID NO: 8 
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SEQ ID NO: 20 
SEQ ID NO: 22 
SEQ ID NO: 23 
SEQ ID NO: 24 
SEQ ID NO: 25 
SEQ ID NO: 26 
SEQ ID NO: 27 
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SEQ ID NO: 29 
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SEQ ID NO: 12 
SEQ ID NO: 14 
SEQ ID NO: 16 
SEQ ID NO: 18 
SEQ ID NO: 20 
SEQ ID NO: 22 
SEQ ID NO: 23 
SEQ ID NO: 24 
SEQ ID NO: 25 
SEQ ID NO:26 
SEQ ID NO: 27 
SEQ ID NO:28 
SEQ ID NO: 29 



SEQ ID NO: 2 
SEQ ID NO: 4 
SEQ ID NO: 6 
SEQ ID NO: 8 
SEQ ID NO: 10 
SEQ ID NO: 12 
SEQ ID NO: 14 
SEQ ID NO: 16 
SEQ ID NO: 18 
SEQ ID NO: 20 
SEQ ID NO: 22 
SEQ ID NO: 23 
SEQ ID NO: 24 
SEQ ID NO: 25 
SEQ ID NO: 26 



Figure 1 
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CGDDVGRNPDGEPFVACNECAFPICRDCYEYERREGTQNCPQCKTRFKRLKGCARVPGD- 

CGDEVGVGFDGEPFVACNECAFPVCRACYEYERREGSQACPQCRTRYKRLKGCPRVAGD- 
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cpy 
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Figure 1 (cont'd.) 

SEQ ID NO: 27 CGDQIGLTVEGDLFVACNECGFPACRPCYEYERREGTQNCPQCKTRYKRLRGSPRVEGDE 

SEQ ID NO: 28 

SEQ ID NO: 29 CRDEIELTVDGEPFVACNECAFPVCRPCYEYERREGNQACPQCKTRFKRLKGSPRVEGD- 

181 240 

SEQ ID NO: 2 

SEQ ID NO: 4 EEEDGVDDLENEFNWSDK H DSQ YLAESMLHAHMS YG- RGADLDGVPQP FH PI PNVP 

SEQ ID NO: 6 _ 

SEQ ID NO: 8 EEEDGVDDLEGEFGLQDGAAHEDDPQYVAESMLRAQMSYG-RGGDAH PGFSPVPNVP 

SEQ ID NO: 10 EEEDGVDDLDNEFNW-DG HDSQSVAESMLYGHMSYG-RGGDPNGAPQAFQLNPNVP 

SEQ ID NO: 12 

SEQ ID NO: 14 DEEDV-DDIEHEFNIDEQKNKHGQ VAEAMLHGRMSYG — RGPEDDDNSQFPTPVIAG 

SEQ ID NO: 16 

SEQ ID NO: 18 " 

SEQ ID NO: 20 

SEQ ID NO: 22 p 

SEQ ID NO: 23 DEDDV-DDIENEFNYAQGANKARH QRHGE EFSSS — SRHESQPIPLLTHGHTVS 

SEQ ID NO: 24 EEEEDI DDLEYEFD HGMDPEHAAEAALS S RLNTG — RGGLDS AP PG SOIP 

SEQ ID NO: 25 _„ 

SEQ ID NO: 26 IGDLDDD TPDYSSGALPLPAPG-"-"" 

SEQ ID NO: 27 DEEDI-DDIEYEFNIEHEQDKHKH SAEAMLYGKMSYG — RGPEDDENGRFP-PVIAG 

SEQ ID NO: 28 

SEQ ID NO: 29 EEEDDI DDLDNEFEYGN NGIGFDQVSEGMSISRRNSGFPQSDLDSAPPG—SQIP 

241 300 

SEQ ID NO: 2 — . 

SEQ ID NO: 4 LLTNGQMVDDI PPDQHALVPS FV GGGGKRIHPLPYADPNLPVQPRSMDPSKDLAAYG 

SEQ ID NO: 6 

SEQ ID NO: 8 LLTNGQMVDDI PPEQHALVPSYMSGGGGGGKRIHPLPFADPNLPVQPRSMDPSKDLAAYG 

SEQ ID NO: 10 LLTNGQMVDDI PPEQHALVPSFM GGGGKRIHPLPYADPSLPVQPRSMDPSKDLAAYG 

SEQ ID NO: 12 

SEQ ID NO: 14 GRSR PVSGEFPISSNAYGDQMLSSSLHKRVHPYPVSEPGSARW DEKKXDG 

SEQ ID NO: 16 

SEQ ID NO: 18 

SEQ ID NO: 20 

SEQ ID NO: 22 LLTNGQMVDDIPPEQHALVPSYMSGGGGGGKRIHPLPFADPNLPVQPRSMDPSKDLAAYG 

SEQ ID NO: 23 GEIRTPDTQSVRTTSGPLGPSDRNAISSPYIDPR-QPVPVRIVDPSK DLNSYG 

SEQ ID NO: 24 LLTYCDEDADMYSDRHALIVP — PS-TGYGNRVYPAPFTDSSAPPQARSMVPQKDIAEYG 

SEQ ID NO: 25 

SEQ ID NO: 26 KDQRGNNNNMSMMKRNQNGEFDHNRWLF ETQGTYG 

SEQ ID NO: 27 GHS GEFPVGG-GYGNG — EHGLHKRVHPYPSSEAGS EGG 

SEQ ID NO: 28 

SEQ ID NO: 29 LLTYGDEDVEISSDRHALIVP — PSLGGHGNRVHPVSLSDPTVAAHRRLMVPQKDLAVYG 

301 360 

SEQ ID NO: 2 

SEQ ID NO: 4 YGSVAWKERMESWKQKQ-ERMHQTRNDGGGD DG DDADLPLM- DEARQ PLS R 

SEQ ID NO: 6 

SEQ ID NO: 8 YGSVAWKERMEGWKQKQ-ERLQHVRSEGGGDW DGDDADLPLM- DEARQ PLS R 

SEQ ID NO: 10 YGSVAWKERMENWKQRQ-ERMHQTGNDGGGD DG DDADL PLM- DEARQQLS R 

SEQ ID NO: 12 „_„ 

SEQ ID NO: 14 WKDRMDDWKLQQG NLGPEPDEDPDAAML-DEARQPLSR 

SEQ ID NO: 16 

SEQ ID NO: 18 HE 

SEQ ID NO: 20 

SEQ ID NO: 22 YGSVAWKERMEGWKQKQ-ERLQHVRSEGGGDW DG DDADLPLM- DEARQPLSR 

SEQ ID NO: 23 LGNVDWKERVEGWKLKQEKNMLQMTGKYHEGKGG-EIEGTGSNGEELQM-ADDTRLPMSR 
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Figure 1 (cont'd.) 

SEQ ID NO: 24 YGSVAWKDRMEVWKRRQGEKLQVIKHEGGNNGRGSN-DDDELDDPDMPMM-DEGRQPLSR 

SEQ ID NO: 25 

SEQ ID NO: 26 YGNAYWP QDEMYGD DMDEGMRGGMVETADKPWRPLSR 

SEQ ID NO: 27 WRERMDDWKLQHG NLGPEPDDDPEMGLI-DEARQPLSR 

SEQ ID NO: 28 

SEQ ID NO: 29 YGSVAWKDRMEEWKRKQNEKLQWRHEGDP DFEDGDDADFPMM-DEGRQPLSM 

361 420 
SEQ ID NO: 2 

SEQ ID NO: 4 KIPLPSSQINPYRMIIIIRLWLCFFFHYRVMHPVPDAFALWLISVICEIWFAMSWILDQ 

SEQ ID NO: 6 

SEQ ID NO: 8 KVPISSSRINPYRMIIVIRLWLGFFFHYRVMHPAKDAFALWLISVICEIWFAMSWILDQ 

SEQ ID NO: 10 KIPLPSSQINPYRMIIIIRLVVLGFFFHYRVMHPVNDAFALWLISVICEIWFAMSWILDQ 

SEQ ID NO: 12 • 

SEQ ID NO: 14 KVPIASSKINPYRMVIVARLVILAFFLRYRLMNPVHDALGLWLTSIICEIWFAFSWILDQ 

SEQ ID NO: 16 

SEQ ID NO: 18 LHPVNDAYGLWLTSVICEIWFAVSWIMDQ 

SEQ ID NO: 20 . 

SEQ ID NO: 22 KVPISSSRINPYRMIIVIRLWLGFFFHYRVMHPAKDAFALWLISVICEIWFAMSCILDQ 

SEQ ID NO: 23 WPI PSSRLTPYRWI ILRLI ILCFFLQYRTTHPVKNAYPLWLTSVICEIWFAFSWLLDQ 

SEQ ID NO: 24 KLPIRSSRINPYRMLILCRLAILGLFFHYRILHPVNDAYGLWLTSVICEIWFAVSWILDQ 

SEQ ID NO: 25 . 

SEQ ID NO: 26 RIPIPAAIISPYRLLIVIRFWLCFFLTWRIRNPNEDAIWLWLMSIICELWFGFSWILDQ 

SEQ ID NO: 27 KVPIASSKINPYRMVIVARLVILAVFLRYRLLNPVHDALGLWLTSVICEIWFAVSWILDQ 

SEQ ID NO: 28 _ 

SEQ ID NO: 29 KIPIKSSKINPYRMLIVLRLVILGLFFHYRILHPVKDAYALWLISVICEIWFAVSWVLDQ 

421 480 
SEQ ID NO: 2 

SEQ ID NO: 4 FPKWFPIERETYLDRLSLRFDKEGHPS QLAPVDFFVSTVDPLKEPPLVTANTVLS 

SEQ ID NO: 6 

SEQ ID NO: 8 FPKWLPIERETYLDRLSLRFDKEGQPS QLAPIDFFVSTVDPTKEPPLVTANTVLS 

SEQ ID NO: 10 FPKWFPIERETYLDRLSLRFDKEGQPS QLAPI DFFVSTVDPLKEPPLVTTNTVLS 

SEQ ID NO: 12 

SEQ ID NO: 14 FPKWFPIDRETYLDRLSIRYEREGEPN MLAPVDVFVSTVDPMKEPPLVTANTVLS 

SEQ ID NO: 16 

SEQ ID NO: 18 FPKWYPIQRETYLDRLSLRYEKEGKPS ELSSVDVFVSTVDPMKEPPLITANTVLS 

SEQ ID NO: 20 

SEQ ID NO: 22 FPKWFPIERETYLDRLSLRFDKEGQPS QLAPIDFFVSTVDPTKEPPLVTANTVLS 

SEQ ID NO: 23 FPKWYPINRETYLDRLAIRYDRDGEPS QLVPVDVFVSTVDPLKEPPLVTANTVLS 

SEQ ID NO: 24 FPKWYPIERETYLDRLSLRYEKEGKPS GLAPVDVFVSTVDPLKEPPLITANTVLS 

SEQ ID NO: 25 

SEQ ID NO: 26 IPKLCPINRSTDLEVLRDKFDMPSPSNPTGRSDLPGIDLFVSTADPEKEPPLVTANTILS 

SEQ ID NO: 27 FPKWFPIERETYLDRLSLRYEREGEPN MLAPVDVFVSTVDPLKEPPLVTSNTVLS 

SEQ ID NO: 28 

SEQ ID NO: 29 FPKWYPIERETYLDRLSLRYEKEGKPS GLS PVDVFVSTVDPLKEPPLI TANTVLS 

481 540 
SEQ ID NO: 2 

SEQ ID NO: 4 ILSVDYPVDKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFCKRYSLEPRAPEWYFQQ — 

SEQ ID NO: 6 H 

SEQ ID NO: 8 ILSVDYPVEKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFSKKFNIEPRAPEWYFQQ — 

SEQ ID NO: 10 ILSVDYPVDKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFCKRYNIEPRAPEWYFQQ— 

SEQ ID NO: 12 ILAAGYPAGKVTCYIS DDAGAEVTRNAWEAARFAALWVS FCRKHGVEPRNLEAYFNAGE 

SEQ ID NO: 14 ILAMDYPVDKISCYISDDGASMCTFESLSETAEFARKWVPFCKKFSIEPRAPEMYFSE — 

SEQ ID NO: 16 

SEQ ID NO: 18 ILAVDYPVDKVACYVSDDGAAMLTFEALSETSEFARRWVPFCKKYNIEPRAPEWYFGQ — 
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Figure 1 (cont'd.) 

SEQ ID NO: 20 

SEQ ID NO: 22 ILSVDYPVEKVSCWSDDGAAMLTFEALSETSEFAKKWVPFSKKFNIEPRAPEWYFQQ-- 

SEQ ID NO: 23 ILSVDYPVDKVACYVSDDGSAMLTFESLSETAEFAKKWVPFCKKFNIEPRAPEFYFAQ— 

SEQ ID NO: 24 IIAVDYPVDKVACYVSDDGAAMLTFEALSDTAEFARKWVPFCKKFNIEPRAPEWYFSQ-- 

11% ™ RRWVPFCKKHNVEPRAPEFYFNE— 

SEQ ID NO: 26 ILAVDYPVEKVSCYLSDDGGALLSFEAMAEAASFADLWVPFCRKHNIEPRNPDSYFSL-- 

SEQ ID NO: 27 ILAMDYPVEKISCYVSDDGASMLTFESLSETAEFARKWVPFCKKFSIEPRAPEMYFTL— 

SEQ ID NO: 28 DYPVEKVSCYVSDDGAAMLTFEALSETSEFARKWVPFCKKYNIEPRAPEWYFAQ— 

SEQ ID NO: 29 ILAVDYPVDKVACYVSDDGAAMLTFEALSETAEFARKWVPFCECKYCIEPRAPEWYFCH— 

541 600 
SEQ ID NO: 2 „_ 

SEQ ID NO: 4 KIDYLKDKVAPNFVRERRAMKREYEEFKVRINALVAKAQ 

SEQ ID NO: 6 ; 

SEQ ID NO: 8 KI DYLKDKVAASFV RE RRAMKRE YEEFKVRINALVAKAQ 

SEQ ID NO: 10 KI DYLKDKVAANFVRERRAMKREYEEFKVRINALVAKAQ 

SEQ ID NO: 12 GGGGKAKWARGS Y- RGMAWPELVRDRRRVRRE YEEMRLRI DALQAADARRR 

ff? ™ KI DYLKDKVQPT FVKERRAMKRE YEEFKVRINALVAKAQ 

AKAQ 

KMDYLKNKVH PA FVRERRAMKRDYEEFKVR I NSLVATAQ 



SEQ ID NO: 16 
SEQ ID NO: 18 
SEQ ID NO: 20 

SEQ ID NO: 22 KI DYLKDKVAAS FVRERRAMKRE YEEFKVRINALVAKAQ 

SEQ ID NO: 23 KIDYLKDKIQ PS FVKERRAMKRE YEEFKVRINALVAKAQ 

SEQ ID NO: 24 KMDYLKNKVH PAFVRERRAMKRDYEEFKVKINALVATAQ- 

SEQ ID NO: 25 KIDYLKDKVH PS FVKERRAMKRE YEEFKVRINALVAKAQ 

SEQ ID NO: 26 KIDPTKNKSRIDFVKDRRKIKREYDEFKVRINGLPDSIRRRSDAFNAREE 

SEQ ID NO: 27 KVDYLQDKVH PT FVKERRAMKREYEEFKVRINAQVAKAS 

SEQ ID NO:28 KIDYLKDKVQTS FVKERRAMKRE YEEFKVRVNGLVAKAQ 

SEQ ID NO: 29 KMDYLKNKVH PAFVRERRAMKRDYEEFKVKINALVATAQ 



601 



SEQ ID NO: 2 



660 



SEQ ID NO: 4 KVPEEGWTMQDGTPWPG NNVRDHPGMIQVFL 

SEQ ID NO: 6 

SEQ ID NO: 8 KVPEEGWTMQDGSPWPG NNVRDHPGMIQVFL 

SEQ ID NO: 10 KVPEEGWTMQDGTPWPG NNVRDHPGMIQVFL 

™ 22 ! H RRGAADDHAGVVQVLI DFA 

SEQ ID NO: 14 KVPQGGWIMQDGTPWPG NNTKDHPGMIQVFL— 

SEQ ID NO: 16 KMPEEGWTMQDGT PWPG NNPRDHPGMIQVFL 

SEQ ID NO: 18 KVPEDGWTMQDGTPWPG NNVRDHPGMIQVFL 

SEQ ID NO: 20 

SEQ ID NO: 22 KVPEEGWTMQDGSPWPG 

SEQ ID NO: 23 KI PEEGWTMQDGTPWPG 

SEQ ID NO: 24 KVPEEGWTMQDGTPWPG 



NNTRDHPGMIQVFL 

NNVRDHPGMIQVFL 

NNTRDHPGMIQVYL- 



SEQ ID NO: 25 KKPEEGWVMQDGT PWPG 

SEQ ID NO: 26 MKALKQMRESGGDPTEPVKVPKATW-MADGTHWPGTWAASTREHSKGDHAGILQ^LKPP 

SEQ ID NO: 27 KVPLEGWIMQDGTPWPG NNTKDHPGMIQVFL 

SEQ ID NO: 28 KVPEEGWIMQDGTPWPG NNTRDHPGMIQVFL—- 

SEQ ID NO:29 KVPEDGWTMQDGTPWPG NSVRDH PGMIQVFL 



661 



SEQ ID NO: 2 



720 



SEQ ID NO: 4 G-QSGGHDVE GNEL PRLV YVS REKRPG YNHH KKAGAMNALVRVS AVLTNA 

SEQ ID NO: 6 

11% " N0:8 G-QSGGRDVE GNELPRLVYVSREKRPGYNHHKKAGAMNALVRVSAVLSNA 

l~ !l ? G-QSGGLDCE GNELPRLVYVSREKRPGYNHHKKAGAMNALVRVSAVLTNA 

SEQ ID NO: 12 GSVPQLGVANGSKLIDVASVDVCLPALVYVCREKRRGHAHHRKAGAMNA 



4/8 



WO 00/04166 



PCT/US99/15871 



SEQ ID NO: 14 
SEQ ID NO: 16 
SEQ ID NO: 18 
SEQ ID NO: 20 
SEQ ID NO: 22 
SEQ ID NO: 23 
SEQ ID NO: 24 
SEQ ID NO: 25 
SEQ ID NO: 26 
SEQ ID NO: 27 
SEQ ID NO: 28 
SEQ ID NO: 29 



SEQ ID NO: 2 
SEQ ID NO: 4 
SEQ ID NO: 6 
SEQ ID NO: 8 
SEQ ID NO: 10 
SEQ ID NO: 12 
SEQ ID NO: 14 
SEQ ID NO: 16 
SEQ ID NO: 18 
SEQ ID NO: 20 
SEQ ID NO: 22 
SEQ ID NO: 23 
SEQ ID NO: 24 
SEQ ID NO: 25 
SEQ ID NO: 26 
SEQ ID NO: 27 
SEQ ID NO: 28 
SEQ ID NO: 29 



SEQ ID NO: 2 
SEQ ID NO: 4 
SEQ ID NO: 6 
SEQ ID NO: 8 
SEQ ID NO: 10 
SEQ ID NO: 12 
SEQ ID NO: 14 
SEQ ID NO: 16 
SEQ ID NO: 18 
SEQ ID NO: 20 
SEQ ID NO: 22 
SEQ ID NO: 23 
SEQ ID NO: 24 
SEQ ID NO: 25 
SEQ ID NO: 26 
SEQ ID NO: 27 
SEQ ID NO: 28 
SEQ ID NO: 29 



Figure 1 (cont'd.) 

•G-SSGGLDTE GNQLPRLVYVSREKRPGFQHHKKAGAMNALVRVSAVLTNA 

G-HSGGLDTD GNELPRLVYVSREKRPGFQHHKKAGAMNALIRVSAVLTNG 

G-QDGVRDVE GNELPRLVYVSREKRPGFDHHKKAGAMNALVRASAI ITNA 



G-HSGGLDTD GNELPRLIYVSREKRPGFQHHKKAGAMNALIRVSAVLTNG 

G-HSGVRDTD GNELPRLVYVSREKRPGFDHHKKAGAMNSLIRVSAVLSNA 

G-SAGALDVD GKELPRLVYVS REKRPG YQHHKKAGAENALVRVSAVLTNA 

SS DPLIG-NS DDKVI DFS DT DTRLPMFVYVS REKRPG YDHNKKAGAMNALVRASAILSNG 

G-HSGGFDVE GHELPRLVYVS REKRPG FQHHKKAGAMNALVRVAGVLTNA 

G-QSGGLDAE GNEL PRL VY VS REKR PG FQH HKKAGAMNALVRVS AVLTNG 

G-S DGVRDVE NNELPRLVYVS REKRPG FDHHKKAGAMNSL I RVSGVLSNA 



PYLLNLDCDHYINNSKAIKEAMCFMMDPLLGKK VCYVQFPQRFDGIDRHDRYAN 



AYLLNLDCDHYINNSKAIKEAMCFMMDPLVGKK-- VCYVQFPQRFDGI DKNDRYAN 

PYLLNLDCDHYINNSKAIKEAMCFMMDPLLGKK VCYVQFPQRFDGIDRHDRYAN 

PFILDLDCDYYVNNSQALRAGICFMIERGGGGAAEDAGAVAFVQFPQRVDGVDPGDRYAN 

PFMLNLDCDHYVNNSKAAREAMCFLMDPQTGKK VCYVQFPQRFDGIDTHDRYAN 

AYLLNVDCDHYFNNSKALKEAMCFMMDPVLGKK TCYVQFPQRFDGIDLHDRYAN 

PYLLNVDCDHYINNSKALREAMCFMMDPQLGKK VCYVQFPQRFDGIDRHDRYSN 

EAMCFLMDPNLGPQ VC YVQFPQRFDG I DRNDR YAN 



AYLLNVDCDHYFNNSKAIKEAMCFMMDPAIGKK — CCYVQFPQRFDGIDLHDRYAN 

PYLLNVDCDHYINNSKAIRESMCFMMDPQSGKK VCYVQFPQRFDGIDRHDRYSN 

PFILNLDCDHYINNSKAMREAMCFLMDPQFGKK LCYVQFPQRFDGIDRHDRYAN 

PFILNLDCDHYIYNCKAVREGMCFMMDRG-GED ICYIQFPQRFEGIDPSDRYAN 

PFMLNLDCDHYVNNSKAVREAMCFLMDPQIGKK VC YVQFPQRFDG I DTNDRYAN 

AFLLNLDCDHYINNSKALREAMCFLMDPNLGKQ VCYVQFPQRFDGI DRNDR YAN 

PYLLNVDCDHYINNSKALREAMCFMMDPQSGKK ICYVQFPQRFDGIDRHDRYSN 



RNWFFDINMKGLDGIQGPIYVGTGCVFRRQALYGYDAP KTKKPPSRTCNCWPKWCI 



RNWFFDINMKGLDGIQGPIYVGTGCVFRRQALYGYDAP KTKKPPSRTCNCWPKWCL 

RNWFFDINMKGLDGIQGPIYVGTGCVFRRQALYGYDAP KTKKPPSRTCNCWPKWCF 

HNRVLFDCTELGLDGLQGPIYVGTGCLFRRVALYSVDLPR 

RNTVFFDINMKGLDGIQGPVYVGTGCVFRRQALYGYNPPKGPK — RPKMVSCDC 

RNIVFFDINMKGQDGVQGPVYVGTGCCFNRQALYGYDPVLTEEDLE PNIIV 

RNWFFDINMKGLDGIQGPIYVGTGCVFRRYALYGYDAP AKKKPPSKTCNCWPKWCC 

RNTVFFDINLRGLDGIQGPVYVGTGCVFNRTAIYGYEPPIKAK — K PGFLA 



RNIVFFDINMKGLDGIQGPVYVGTGCCFNRQALYGYDPVLTEEDLE PNIIV 

RNWFFDINMKGLDGIQGPIYVGTGCVFRRQALYGFDAP — -KKKKPPGKTCNCWPKWCC 

RNWFFDINMLGLDGLQGPVYVGTGCVFNRQALYGYDPPVSEKRPK MTCDCWPSWCC 

NNTVFFDGNMRALDGVQGPVYVGTGTMFRRFALYGFDPP 

RNTVFFDINMKGLDGIQGPVYVGTGCVFKRQALYGYEPPKGPK — RPKMISCGC 

RNTVFFDI NLRGLDG I QG PVYVGTGCVFNRTAL YG YE PPLKPKHRK TGILS 

RNWFFDINMKGLDGLQGPIYVGTGCVFRRQALYGFDAP KKKKG PRKTCNCWPKWCL 
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Figure 1 (cont'd.) 



841 



900 



SEQ ID NO: 2 

SEQ ID NO: 4 CCCCFGNRKTKKKTKTSKP KFEKIKKLF-KKKENQAPAYALGEIDEA — APG- 

SEQ ID NO: 6 

SEQ ID NO: 8 SCCC SRNKNKKKTTKP KTEKKKRLFFKKAENPSPAYALGEIDEG — APG- 

SEQ ID NO: 10 CCCC FGNRKQKK TTKP KTEKKKLLFFKKEENQS PAYALGE I DEA — APG - 

SEQ ID NO: 12 

SEQ ID NO: 14 -CPCFGSRKKYKE KNDANGEAASLKG 

SEQ ID NO: 16 - KSCCGSRKKGKGGNKK YS-DKKKAMGR — TESTVPIFNMEDIEEGVEG — Y 

SEQ ID NO: 18 LCC GSRKKKN ANS KKEKKRKV — KHSEASKQIHALENIEAGN — EG- 

SEQ ID NO: 20 -SLCXG-KKKASKSKKR SSDKKKSNKH — VDSSVPVFNLEDIEEGVEGAGF 

SEQ ID NO: 22 

SEQ ID NO: 23 -KSCCGSRKKGKSS-KK YNYEKRRGINR — SDSNAPLFNMEDIDEGFEG — Y 

SEQ ID NO: 24 LCC GLRKKSK T KAKDKKT — NTKETSKQIHALENVDEGVIVPV- 

SEQ ID NO: 25 "CCCGGSRKKSKKKGEKKGLLGGLLYGKKKKMMGKNYVKKGSAPVFDLEEIEEGLEG--Y 

SEQ ID NO: 26 NPDKLLEKKESETEALTTSDFDPDLDVTQLPKRFGNSTLL AESIPI 

SEQ ID NO: 27 -CPCFGRRRKNKK FS KN DMNG DVAALGG 

SEQ ID NO: 28 - S LCGGS RKKS S KS S KK GSDKKKSGKH — VDSTVPVFNLEDIEEGVEGAGF 

SEQ ID NO: 29 LCF GSRKNRK AKT VAADKKK KNREASKQI HALENI EEGRGHKV- 

901 960 
SEQ ID NO: 2 

SEQ ID NO: 4 AENEKASIVNQQKLEKKFGQSSVFVASTLLENGGTLKSASPASLLKEAIHVISCGYEDKT 

SEQ ID NO: 6 ET 

SEQ ID NO: 8 ADIEKAGIVNQQKLEKKFGQSSVFVASTLLENGGTLKSASPASLLKEAIHVISCGYEDKT 

SEQ ID NO: 10 AENEKAGIVNQQKLEKKFGQSSVFVTSTLLENGGTLKSASPASLLKEAIHVISCGYEDKT 

SEQ ID NO: 12 

SEQ ID NO: 14 MDDDKEVLMSQMNFEKKFGQSSIFVTSTLMEEGGVPPSSSPAALLKEAIHVISCGYEDKT 

SEQ ID NO: 16 DD-ERTLLMSQKSLEKRFGQSPVFIAATFMEQGGIPPSTNPATLLKEAIHVISCGYEDKT 

SEQ ID NO: 18 TNNEKTSNLTQTKLEKRFGQSPVFVASTLLDDGGVPHGVSPASLLKEAIQVISCGYEDKT 

SEQ ID NO: 20 DD-EKSVLMSQMSLEKRFGQSAAFVASTLMEYGGVPQSSTPESLLKEAIHVISCGYEDKS 

SEQ ID NO: 22 

SEQ ID NO: 23 DD-ERSILMSQRSVEKRFGQSPVFIAATFMEQGGIPPTTNPATLLKEAIHVISCGYEDKT 

SEQ ID NO: 24 SNVEKRSEATQLKLEKKFGQSPVFVASAVLQNGGVPRNASPACLLREAIQVISCGYEDKT 

SEQ ID NO: 25 EELEKSTLMSQKNFEKRFGQSPVFIASTLMENGGLPEGTNSTSLIKEAIHVISCGYEEKT 

SEQ ID NO: 26 AEFQGRPLADHPAV — KYGRPP GALR VPRDPLDATTVAESVSVISCWYEDKT 

SEQ ID NO: 27 AEGDKEHLMFEMNFEKTFGQSSIFVTSTLMEEGGVPPSSSPAVLLKEAIHVISCGYEDKT 

SEQ ID NO: 28 DD-EKSLLMSQMSLEKRFGQSAVFVASTLMENGGVPQSATPETLLKEAIHVISCGYEDKT 

SEQ ID NO: 29 LNVEQSTEAMQMKLQKKYGQSPVFVASARLENGGMARNASPACLLKEAIQVISRGYEDKT 

961 1020 

SEQ ID NO: 2 HEDILTGFKMHARGWISIYCMPPRPCFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 4 GWGKDIGWIYGSVTEDILTGFKMHCHGWRSIYCIPKRAAFKGSAPLNLSDRLHQVLRWAL 

SEQ ID NO: 6 EWGKEIGWIYGSVTEDILTGFKMHCRGWKSVYCTPTRPAFKGSAPINLSDRLHQVLRWAL 

SEQ ID NO: 8 DWGKEIGWIYGSITEDILTGFKMHCHGWRSIYCIPKRPAFKGSAPLNLSDRLHQVLRWAL 

SEQ ID NO: 10 DWGKEIGWIYGSVTEDILTGFKMHCHGWRS I YCIPKRVAFKGSAPLNLS DRLHQVLRWAL 

SEQ ID NO: 12 WRP RRSL 

SEQ ID NO: 14 EWGLELGWIYGSITEDILTGFKMHCRGWRSIYCMPKRAAFKGTAPINLSDRLNQVLRWAL 

SEQ ID NO: 16 EWGKEIGWIYGSVTEDILTGFKMHARGWISIYCMPPRPAFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 18 EWGKEVGWI YGS VTEDI LTG FKMHCHGWRS VYC I PKRPAFKGSAP I NLS DRLHQVLRWAL 

SEQ ID NO: 20 EWGTEIGWIYGSVTEDILTGFKMHARGWRSIYCMPKRPAFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 22 

SEQ ID NO: 23 EWGKEIGWIYGSVTEDILTGFKMHARGWISIYCNPPRPAFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 24 EWGKEIGWIYGSVTEDILTGFKMHCHGWRSVYCMPKRAAFKGSAP I NLS DRLHQVLRWAL 

SEQ ID NO: 25 EWGKEIGWIYGSVTEDILTGFKMHCRGWKSVYCVPKRPAFKGSAPINLSDRLHQVLRWAL 

SEQ ID NO: 26 EWGDRVGWIYGSVTEDWTGYRMHNRGWRSVYCITKRDSFRGSAPINLTDRLHQVLRWAT 
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Figure 1 (cont'd.) 

SEQ ID NO: 27 EWGTELGWIYGSITEDILTGFTCMHCRGWRSIYCMPKRPAFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 28 DWGSEIGWIYGSVTEDILTGFKMHARGWRSIYCMPKRPAFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 29 EWGKEIGWIYGSVTEDILTGSKMHSHGWRHVYCTPKLAAFKGSAPINLSDRLHQVLRWAL 

1021 1080 

SEQ ID NO: 2 GSVEILFSRHCPIWYNYGG-RLKLLERMAYINTIVYPITSLPLIAYCVLPAICLLTNKFI 

SEQ ID NO: 4 GSIEIFFSNHCPLWYGYGGG-LKFLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFI 

SEQ ID NO: 6 GSVEIFMSRHCPLWYAYGG-RLKWLERFAYTNTIVYPFTSIPLLAYCTIPAVCLLTGKFI 

SEQ ID NO: 8 GSVEIFFSKHCPLWYGYGGG-LKFLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFI 

SEQ ID NO: 10 GSIEIFFSNHCPLWYGYGGG-LKFLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFI 

SEQ ID NO: 12 G CRL 

SEQ ID NO: 14 GSIEIFFSHHCPLWYGFKEKKLKWLERFAYANTTVYPFTSIPLVAYCILPAVCLLTDKFI 

SEQ ID NO: 16 GS I E I FLS RHC PLW YG YNG- KLKPLMRLAY INTIVYPFTSI PLI AYCTLPAFCLLTNKFI 

SEQ ID NO: 18 GSVEIFFSRHCPIWYGYGGG-LKLLERFSYINSWYPWTSLPLLVYCTLPAICLLTGKFI 

SEQ ID NO: 20 GSVEILFSRHCPLWYGYGG-RLKFLERFAYINTTIYPLTSLPLLVYCILPAICLLTGKFI 

SEQ ID NO: 22 

SEQ ID NO: 23 GSIEILLSRHCPIWYGYHG-RLRLLERIAYINTIVYPITSIPLIAYCILPAFCLITDRFI 

SEQ ID NO: 24 GSVEIFLSRHCPIWYGYGGG-LKWLERFSYINSVVYPWTSLPLIVYCSLPAVCLLTGKFI 

SEQ ID NO: 25 GS VE I FLS RHCPLW YG YGG- KLKWLERLAY I NT I VY P FTS I PLLAYCT I PAVCLLTGKFI 

SEQ ID NO: 26 GSVEI FFSRNNAI LASKRLKFLQRLAYLNVGI YPFTSLFLILYCFLPAFSLFSGQFI 

SEQ ID NO: 27 GSVEIFFSRHSPLWYGYKGGKLKWLERFAYANTTIYPFTSIPLLAYCILPAICLLTDKFI 

SEQ ID NO: 28 GSVEI LFSRHCPIWYGYSG-RLKWLERFAYVNTTIYPVTAIPLLMYCTLPAVCLLTNKFI 

SEQ ID NO: 29 GSVEI FLSRHCPIWYGYGGG-LKWLERLSYINSWYPWTSLPLIVYCSLPAICLLTGKFI 

1081 1140 

SEQ ID NO: 2 I PEI SNYAGMFFI LMFAS I FATGILELRWSGVGI EDWWRNEQFWVIGGTSAHLFAVFQGL 

SEQ ID NO: 4 T PELNNVAS LWFMS L FI C I FATS I LEMRWS G VG I DDWWRNEQ FWV I GG VS S HL FAVFQG L 

SEQ ID NO: 6 IPTLNNLASIWFIALFLSIIATSVLELRWSGVSIEDWWRNEQFWVIGGVSAHLFAVFQGF 

SEQ ID NO: 8 TPELTNVASIWFMALFICISVTG I LEMRWSGVAI DDWWRNEQ FWVIGGVSAHLFAVFQGL 

SEQ ID NO: 10 TPELNNVASLWFMSLFICIFATS I LEMRWSGVG I DDWWRNEQ FWVIGGVSSHLFAVFQGL 

SEQ ID NO: 12 

SEQ ID NO: 14 MPPI ST FAGLYFVALFSS 1 1 ATGILELKWSGVS IEEWWRNEQFWVI GGVSAHLFAVIQGL 

SEQ ID NO: 16 I PEI SNFASMWFILLFVS I FTTS I LELRWSGVS I EDWWRNEQFWVIGGTSAHLFAVFQGL 

SEQ ID NO: 18 VPEISNYASLVFMALFISIAATGILEMQWGGVSIDDWWRNEQFWVIGGVSSHLFALFQGL 

SEQ ID NO: 20 MPEISNLASIWFIALFLSIFATGILEMRWSGVGIDEWWRNEQFWVIGGISAHLFAVFQGL 

SEQ ID NO: 22 

SEQ ID NO: 23 I PEI SNYASIWFILLFISIAVTGI LELRWSGVS I EDWWRNEQFWVIGGTSAHLFAVFQGL 

SEQ ID NO: 24 VPEISNYAGILFMLMFISIAVTGILEMQWGGVGIDDWWRNEQFWVIGGASSHLFALFQGL 

SEQ ID NO: 25 I PTLSNLTSVWFLAL FLS 1 1 ATGVLELRWSGVS I QDWWRNEQ FWVIGGVSAHLFAVFQGL 

SEQ ID NO: 26 VRTLSISFLVYLLMITICLIGLAVLEVKWSGIGLEEWWRNEQWWLISGTSSHLYAWQGV 

SEQ ID NO: 27 MPPI STFASLFFI SLFMSI I VTGILELRWSGVS IEEWWRNEQFWVIGGI SAHLFAWQGL 

SEQ ID NO: 28 I PQI SNLAS I WFISLFLSI FATGI LKMKWNGVGI DQWWRNEQFWVIGGVSAHLFAVFQGL 

SEQ ID NO: 29 VPEISNYASILFMALFSSIAITGILEMQWGKVGIDDWWRNEQFWVIGGVSAHLFALFQGL 

1141 1200 

SEQ ID NO: 2 LKVLAGIDTNFTVTSKANDEDGD — FAELYVFKWTSLLIPPTTVLVINLVGMVAGISYAI 

SEQ ID NO: 4 LKVIAGVDTSFTVTSKGGDD — EE-FSELYTFKWTTLLIPPTTLLLLNFIGWAGISNAI 

SEQ ID NO: 6 LKVLGGVDTS FTVTSKAAGDEADA- FGDLYLFKWTTLLVPPTTLI I INMVG I VAGVSDAV 

SEQ ID NO: 8 LKVFAGIDTSFTVTSKAGDD — EE-FSELYTFKWTTLLIPPTTLLLLNFIGWAGISNAI 

SEQ ID NO: 10 LKVIAGVDTSFTVTSKGGDD — EE-FSELYTFKWTTLLIPPTTLLLLNFIGWAGVSNAI 

SEQ ID NO: 12 

SEQ ID NO: 14 LKVLAGIDTNFTVTSKATDDE-E — FGELYTFKWTTLLIPPTTILIINIVGWAGISDAI 

SEQ ID NO: 16 LKVLAGIDTNFTVTSKASDEDGD — FAELYVFKWTSLLIPPTTVLIVNLVGIVAGVSYAI 

SEQ ID NO: 18 LKVLAGVNTNFTVTSKAADD — GE- FSELYI FKWTSLLI PPMTLLIMNI VGVWGI SDAI 

SEQ ID NO: 20 LKVLAGI DTNFTVTSKANDEEGD — FAELYMFKWTTLLI PPTTILIINMVGWAGTSYAI 

SEQ ID NO: 22 

SEQ ID NO: 23 LKVLAGI DTNFTVTSKATDEDGD — FAELYIFKWTALLIPPTTVLLVNLIGIVAGVSYAV 
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Figure 1 (cont'd.) 

SEQ ID NO: 24 LKVLAGVNTNFTVTSKAADD — GA-FSELYIFKWTTLLIPPTTLLIINIIGVIVGVSDAI 

SEQ ID NO: 25 LKVLAGVDTNFTVTAKAADDTE FGELYLFKWTTLLIPPTTLIILNMVGVVAGVSDAI 

SEQ ID NO: 26 LKVIAGIEISFTLTTKSGGDDNEDIYADLYIVKWSSLMIPPIVIAMVNIIAIWAFIRTI 

SEQ ID NO: 27 LKILAGIDTNFTVTSKATDDD-D — FGELYAFKWTTLLIPPTTVLIINIVGWAGISDAI 

SEQ ID NO: 28 LKVLAGIDTNFTVTSKASDEDGD — FAELYMFKWTTLLIPPTTLLIINLVGWAGISYVI 

SEQ ID NO: 29 LKVLAGVDTNFTVTSKAADD — GE-FSDLYLFKWTSLLIPPMTLLIINVIGVIVGVSDAI 

1201 1260 

SEQ ID NO: 2 NSGYQSWGPLFGKLFFSIWVILHLYPFLKGLMGKQNRTPTIVIVWSILLASIFSLLWVKI 

SEQ ID NO: 4 NNGYESWGPLFGKLFFAFWVIVHLYPFLKGLVGRQNRTPTIVIVWSILLASIFSLLWVRI 

SEQ ID NO: 6 NNGYGSWGPLFGKLFFSFWVIVHLYPFLKGLMGRQNRTPTIWLWSILLASIFSLVWVRI 

SEQ ID NO: 8 NNGYESWGPLFGKLFFAFWVI VHLYPFLKGLVGRQNRTPTI VI VWS I LLAS I FSLLWVRV 

SEQ ID NO: 10 NNGYESWGPLFGKLFFAFWVI VHLYPFLKGLVGRQNRTPTI VI VWS I LLAS I FSLLWVRI 

SEQ ID NO: 12 LGEDER LWSRM 

SEQ ID NO: 14 NNG YQS WGPL FG KLFFSFWVIVHLYPFLKGLMGRQNRTPT I WIWSVLLAS I FSLLWVRI 

SEQ ID NO: 16 NSGYQSWGPLFGKLFFAIWVIAHLYPFLKGLLGRQNRTPTIVIVWSVLLAS I FSLLWVRI 

SEQ ID NO: 18 NNGYDSWGPLFGRLFFALWVILHLYPFLKGLLGKQDRMPTIILVWSILLASILTLMWVRI 

SEQ ID NO: 20 NSGYQSWGPLFGKLFFAFWVIVHLYPFLKGLMGRQNRTPTIVIVWAVLLASIFSLLWVRV 

SEQ ID NO: 22 

SEQ ID NO: 23 NSGYQSWGPLFGKLFFALWVIAHLYPFLKGLLGRQNRTPTIVIVWSVLLAS I FSLLWVRI 

SEQ ID NO: 24 SNGYDSWGPLFGRLFFALWVIVHLYPFLKGMLGKQDKMPTIIWWSILLASILTLLWVRV 

SEQ ID NO: 25 NNGYGSWGPLFGKLFFAFWVILHLYPFLKGLMGRQNRTPTIVVLWSILLASIFSLVWVRI 

SEQ ID NO: 26 YQAVPQWSKLIGGAFFSFWVLAHLYPFAKGLMGRRGKTPTIVFVWAGLIAITISLLWTAI 

SEQ ID NO: 27 NNG YQSWG PL FGKLFFSFWV I VHLYPFLKGLMGRQNRTPT I WIWSVLLAS I FSLLWVRI 

SEQ ID NO: 28 NSGYQSWGPLFGKLFFAFWVIIHLYPFLKGLMGRQNRTPTIVWWSILLASIFSLLWVRI 

SEQ ID NO: 29 SNGYDSWGPLFGRLFFALWVIIHLYPFLKGLLGKQDRMPTIIVVWSILLASILTLLWVRV 

1261 1284 

SEQ ID NO: 2 DPFISDTQKAVAM-GQCGVNC 

SEQ ID NO: 4 DP FLAKDDG PLL — EECGLDCN — 

SEQ ID NO: 6 DPFIPKAKGPILKP-C-GVEC 

SEQ ID NO: 8 DPFLAKSNGPLL — EECGLDCN — 

SEQ ID NO: 10 DPFLAKDDGPLL — EECGLDCN — 

SEQ ID NO: 12 KQMVILSGPR 

SEQ ID NO: 14 DPFVLKTKGPDTKL — CGINC 

SEQ ID NO: 16 DPFTSDSNKLT — NGQCGINC 

SEQ ID NO: 18 NPFVSRD-GPVL — EICGLNCDES 

SEQ ID NO: 20 DPFTTRLAGPNI — QTCGINC 

SEQ ID NO: 22 

SEQ ID NO: 23 NPFVDANPNANNFNGKGGVF 

SEQ ID NO: 24 NPFVAKG-GPVL — EICGLNCGN- 

SEQ ID NO: 25 DPFLPKQTGPVLKQ-C-GVEC 

SEQ ID NO: 26 NP NTG PAAAAEG VGGGG FQ FP 

SEQ ID NO: 27 DPFVLKTKGPDTSK — CGINC 

SEQ ID NO: 28 DPFTTRVTGPDV — EQCGINC 

SEQ ID NO: 29 NPFVAKG-GPIL — EICGLDC — L 
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SEQUENCE LISTING 

<110> E. I. du Pont de Nemours and Company 

<120> Plant Cellulose Synthases 

<130> BB-1170 

<140> 
<141> 

<150> 60/092,844 
<151> July 14, 1998 

<160> 29 

<170> Microsoft Office 97 

<210> 1 
<211> 1221 
<212> DNA 

<213> Hordeum vulgare 
<400> 1 

gcacgaggat attcttactg ggtttaaaat gcacgcaaga ggttggatat caatctaccg 60 

catgccacca cgaccttgtt tcaagggttc tgcgccaatc aatctctctg accgtctcaa 120 

tcaagttctc cggtgggctc ttgggtcagt tgaaattctg tttagcagac attgtcctau 180 

ctggtacaat tacggtgggc ggttgaaact tctggagagg atggcttaca tcaacaccat 240 

tgtttatcca ataacatccc ttccacttat cgcctattgt gtgcttcctg ctatctgtct 300 

cctcaccaac aaatttatca ttcccgagat cagtaactat gctgggatgt tctttattct 360 

tatgtttgcc tccatctttg ccacgggtat attggagctg cgatggagtg gtgtcggcat 420 

cgaggactgg tggagaaacg agcagttctg ggttattggt ggcacatctg cccatctttt 4 80 

cgcagtgttc cagggtctgc tgaaggtgtt ggccgggatt gacaccaact tcacggttac 540 

ctcgaaggca aacgacgagg atggcgattt tgctgagtta tacgtgttca agtggaccag 600 

tctcctcatt cctccgacca ccgtccttgt gattaacctg gtgggcatgg tggcaggcat 660 

atcatatgcc atcaacagcg gttaccagtc ttggggtcca ctcttcggaa agctcttctt 720 

ctcaatctgg gtgatcctcc atctctaccc cttcctcaag ggtctcatgg ggaagcagaa 780 

ccgcacgcca accatcgtca ttgtttggtc catcctccta gcctccatct tctccctccr 840 

gtgggtgaag atcgaccctt tcatatccga tacccagaaa gccgtcgcca tggggcagtg 900 

tggcgtcaac tgctgatcgg cgccgaagag tatctgcccc cctcgtgtaa ataccggagg 960 

gggttggatg ggattttgtt gttgtagatg aagacggagt tttatgtaag ttattattgc 1020 

cccttcgtgc tgagaagcac aaaccgtgaa gcctacgaaa cctgcagcgt acattgtgat 1080 

ttttttctcc ttttcttttc atctgtgata cctgttgttt cttcttagag tatattatgt 1140 

cagaacgtat ctatagttct atacacacta tgacaccaac tatttatata aggcagctgt 1200 

tgcatcaact cttctgcaaa a 1221 

<210> 2 

<211> 304 

<212> PRT 

<213> Hordeum vulgare 

<400> 2 

His Glu Asp He Leu Thr Gly Phe Lys Met His Ala Arg Gly Trp He 
1 5 10 15 

Ser He Tyr Cys Met Pro Pro Arg Pro Cys Phe Lys Gly Ser Ala Pro 
20 25 30 

He Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Trp Ala Leu Gly 
35 40 45 
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Ser Val Glu lie Leu Phe Ser Arg His Cys Pro lie Trp Tyr Asn Tyr 
50 55 60 

Gly Gly Arg Leu Lys Leu Leu Glu Arg Met Ala Tyr He Asn Thr He 
65 70 75 80 

Val Tyr Pro He Thr Ser Leu Pro Leu He Ala Tyr Cys Val Leu Pro 
85 90 95 

Ala He Cys Leu Leu Thr Asn Lys Phe He He Pro Glu He Ser Asn 
100 105 HO 

Tyr Ala Gly Met Phe Phe He Leu Met Phe Ala Ser He Phe Ala Thr 
115 120 125 

Gly He Leu Glu Leu Arg Trp Ser Gly Val Gly He Glu Asp Trp Trp 
130 135 140 

Arg Asn Glu Gin Phe Trp Val He Gly Gly Thr Ser Ala His Leu Phe 
145 150 155 160 

Ala Val Phe Gin Gly Leu Leu Lys Val Leu Ala Gly He Asp Thr Asn 
165 170 175 

Phe Thr Val Thr Ser Lys Ala Asn Asp Glu Asp Gly Asp Phe Ala Glu 
180 185 * 190 

Leu Tyr Val Phe Lys Trp Thr Ser Leu Leu He Pro Pro Thr Thr Val 
195 200 205 

Leu Val He Asn Leu Val Gly Met Val Ala Gly He Ser Tyr Ala He 
210 215 220 

Asn Ser Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe 
225 230 235 240 

Ser He Trp Val He Leu His Leu Tyr Pro Phe Leu Lys Gly Leu Met 
245 250 * 255 

Gly Lys Gin Asn Arg Thr Pro Thr He Val He Val Trp Ser lie Leu 
260 265 270 

Leu Ala Ser He Phe Ser Leu Leu Trp Val Lys He Asp Pro Phe He 
275 280 285 

Ser Asp Thr Gin Lys Ala Val Ala Met Gly Gin Cys Gly Val Asn Cys 
290 295 300 

<210> 3 

<211> 3776 

<212> DNA 

<213> Zea mays 

<400> 3 

gcgcgccgcg caggcgcaac gcaacaaagg gaaaccccag ccggaggagc aaaagctagc 60 

aagcgtgtcc ctccccctcc ctcactcccg tttcattcca ttccccccca gacgccgcta 120 

ccgccgccgc cgcacgcacg cttgccccgg gatctggaga tctggtagcg ccagggggat 180 

ggaggccagc gccgggctgg tcgccggctc gcacaaccgg aacgagctcg tcgtcatccg 240 

ccgcgatggc gagccagggc cgaagcccat ggaccagcgg aacggccagg tgtgccagat 300 
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ttgcggcgac gacgtggggc gcaaccccga cggggagccg ttcgtggcct gcaacgagtg 360 
cgccttcccc atctgccggg actgctacga gtacgagcgc cgcgagggca cgcagaactg 420 
cccccagtgc aagacccgct tcaagcgcct caaggggtgc gcgcgcgtgc ccggggacga 480 
ggaggaggac ggcgtcgacg acctggagaa cgagttcaac tggagcgaca agcacgactc 540 
ccagtacctc gccgagtcca tgctccacgc ccacatgagc tacggccgcg gcgccgacct 600 
cgacggcgtg ccgcagccat tccaccccat ccccaatgtt cccctcctca ccaacggaca 660 
gatggtcgat gacatcccgc cggaccagca cgcccttgtg ccctcgttcg tgggtggcgg 720 
ggggaagagg attcaccctc tcccgtacgc ggatcccaac cttcctgtgc aaccgaggtc 780 
tatggaccct tccaaggatc tcgccgcata tggctacggg agcgtagcat ggaaggagag 840 
gatggagagc tggaagcaga agcaggagag gatgcaccag acgaggaacg atggcggcgg 900 
cgatgatggt gatgatgcag atctaccact aatggatgaa gctagacagc cattgtccag 960 
aaagatcccg cttccttcaa gccaaatcaa cccctatagg atgattataa taattcggct 1020 
agtggttttg tgtttcttct tccactaccg agtgatgcat ccggtgcctg atgcatttgc 1080 
tttatggctc atatctgtga tctgtgaaat ttggtttgcc atgtcttgga ttcttgacca 1140 
gtttccaaag tggtttccta tcgagaggga aacctatctt gaccggctga gtttaaggtt 1200 
tgacaaggaa gggcatcctt ctcaactcgc ccctgttgat ttctttgtca gtacggttga 1260 
tcccttgaag gaacctccat tggtcactgc taatactgtt ctatctatcc tttcggtgga 1320 
ttatccagtt gataaggttt catgctacgt ttctgatgat ggtgctgcca tgctgacatt 1380 
tgaagcattg tctgaaacat ctgaatttgc aaagaaatgg gttcctttct gcaaaagata 1440 
tagccttgag cctcgtgctc cagagtggta cttccaacag aagatagact acctgaaaga 1500 
caaggtggcg ccaaactttg ttagagaacg gagagcaatg aagagagagt atgaggaatt 1560 
caaggtcaga atcaatgcct tggttgctaa agcccaaaag gttcctgaag aaggatggac 1620 
aatgcaggat ggaactccat ggcccggaaa taatgtccgt gatcatcctg gaatgattca 1680 
ggttttcctt ggtcaaagtg gtggccatga tgtggaagga aatgagctgc ctcgattggt 1740 
ttatgtttca agagaaaaac ggccaggcta caaccatcac aagaaggctg gtgctatgaa 1800 
tgcattggtc cgagtctctg ctgtactaac taatgctcct tatttgctga acttggattg 1860 
tgatcactat atcaataata gtaaggctat aaaggaagca atgtgtttta tgatggatcc 1920 
tttgcttgga aagaaagttt gctatgtgca gtttcctcaa agatttgatg ggattgatcg 1980 
ccatgatcga tatgctaaca gaaatgttgt ctttttcgat atcaacatga aaggtttgga 2040 
tggtatccag ggcccaattt atgtgggtac tggatgtgtc ttcagaaggc aggcattata 2100 
tggctacgat gctcccaaaa caaagaagcc accatcaaga acttgcaact gctggccaaa 2160 
gtggtgcatt tgctgttgct gttttggtaa caggaagacc aagaagaaga ccaagacctc 2220 
taaacctaaa tttgagaaga taaagaaact ttttaagaaa aaggaaaatc aagcccctgc 2280 
atatgctctt ggtgaaattg atgaagccgc tccaggagct gaaaatgaaa aggctagtat 2340 
tgtaaatcaa cagaagttgg aaaagaaatt tggccagtct tcagtttttg ttgcatccac 2400 
acttcttgag aatggtggaa ccctgaagag tgccagtcca gcttctcttc tgaaggaagc 24 60 
tatacatgtc atcagttgtg gatatgaaga caaaacaggc tggggaaaag atattggttg 2520 
gatttatgga ccagtcacag aagatattct tactgggttt aagatgcact gccatggttg 2580 
gcggtcaatt tactgcatac ctaaacgggc cgccttcaaa ggttccgcac ctctcaatct 264 0 
ttccgatcgt cttcaccagg ttcttcggtg ggctcttggt tcaattgaaa ttttcttcag 2700 
caaccactgc cctctctggt atgggtatgg tggtggacta aagttcctgg aaaggttttc 2760 
gtacattaac tccatcgtat acccttggac atctatcccg ctcttggcct attgcacatt 2820 
gcctgccatc tgcttgctga cagggaaatt tatcacgcca gagcttaaca atgttgccag 2880 
cctctggttc atgtcacttt tcatctgcat ttttgctacg agcatcctgg aaatgagatg 2940 
gagtggtgta ggcatcgatg actggtggag aaacgagcag ttttgggtca ttggaggcgt 3000 
gtcttcacat ctctttgctg tgttccaggg actcctcaag gtcatagctg gtgtagacac 3060 
gagcttcact gtgacatcca agggcggaga cgacgaggag ttctcagagc tgtacacatt 3120 
caaatggacg acccttctga tacctccgac aaccctgctc ctactgaact tcattggagt 3180 
ggtagctggc atctccaatg cgatcaacaa cggatatgaa tcatggggcc ccctgttcgg 3240 
gaagctcttc tttgcatttt gggtgatcgt ccatctttac ccgttcctca agggtctggt 3300 
tgggaggcag aacaggacgc caacgattgt cattgtctgg tccatcctcc tggcttcgat 3360 
cttctcgctg ctttgggtcc ggatcgaccc gttccttgcg aaggatgatg gtcccctgtt 3420 
ggaggagtgt ggtctggatt gcaactagga ggtcagcacg tggacttccc cgtcagtgtg 3480 
tggtcgaaga agtatttttg cagatgtttt gtgcccatat ttcttttttc aatttttgtc 3540 
cctctgtaga tagaaacaag gggagaaggg gaaaaaaagt acttgtattt cttttgttcc 3600 
atggtggtgg tggtggtggg cggctcagcc tcgtgagtgc agtattgggc aaaccggagg 3660 
ctgcggcaac cttgtgcagt tcggccacga atatactagg gaagatcgcg accaatcaat 3720 
caatcgatga ccgagttcaa ttgttcagca aaaaaaaaaa aaaaaaaaaa aaaaaa 3776 
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<210> 4 

<211> 1148 

<212> PRT 

<213> Zea mays 

<400> 4 

Arg Ala Ala Gin Ala Gin Arg Asn Lys Gly Lys Pro Gin Pro Glu Glu 
* 5 10 15 

Gin Lys Leu Ala Ser Val Ser Leu Pro Leu Pro His Ser Arg Phe lie 
20 25 30 

Pro Phe Pro Pro Arg Arg Arg Tyr Arg Arg Arg Arg Thr His Ala Cys 
35 40 45 

Pro Gly He Trp Arg Ser Gly Ser Ala Arg Gly Met Glu Ala Ser Ala 
50 55 60 

Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu Leu Val Val He Arg 
65 70 75 80 

Arg Asp Gly Glu Pro Gly Pro Lys Pro Met Asp Gin Arg Asn Gly Gin 
85 90 95 

Val Cys Gin He Cys Gly Asp Asp Val Gly Arg Asn Pro Asp Gly Glu 
100 105 HO 

Pro Phe Val Ala Cys Asn Glu Cys Ala Phe Pro He Cys Arg Asp Cys 
115 120 125 

Tyr Glu Tyr Glu Arg Arg Glu Gly Thr Gin Asn Cys Pro Gin Cys Lys 
130 135 140 

Thr Arg Phe Lys Arg Leu Lys Gly Cys Ala Arg Val Pro Gly Asp Glu 
145 150 155 ' 160 

Glu Glu Asp Gly Val Asp Asp Leu Glu Asn Glu Phe Asn Trp Ser Asp 
165 170 175 

Lys His Asp Ser Gin Tyr Leu Ala Glu Ser Met Leu His Ala His Met 
180 185 190 

Ser Tyr Gly Arg Gly Ala Asp Leu Asp Gly Val Pro Gin Pro Phe His 
195 200 205 

Pro He Pro Asn Val Pro Leu Leu Thr Asn Gly Gin Met Val Asp Asp 
210 215 220 

He Pro Pro Asp Gin His Ala Leu Val Pro Ser Phe Val Gly Gly Gly 
225 230 235 240 

Gly Lys Arg He His Pro Leu Pro Tyr Ala Asp Pro Asn Leu Pro Val 
245 250 255 

Gin Pro Arg Ser Met Asp Pro Ser Lys Asp Leu Ala Ala Tyr Gly Tyr 
260 265 270 

Gly Ser Val Ala Trp Lys Glu Arg Met Glu Ser Trp Lys Gin Lys Gin 
275 280 285 
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Glu Arg Met His Gin Thr Arg Asn Asp Gly Gly Gly Aso Asp Gly Asp 
290 295 300 * 

Asp Ala Asp Leu Pro Leu Met Asp Glu Ala Arg Gin Pro Leu Ser Arg 
305 310 315 320 

Lys He Pro Leu Pro Ser Ser Gin He Asn Pro Tyr Arg Met He He 
325 330 ' 335 

He He Arg Leu Val Val Leu Cys Phe Phe Phe His Tyr Arg Val Met 
340 345 350 

His Pro Val Pro Asp Ala Phe Ala Leu Trp Leu He Ser Val He Cys 
355 360 365 

Glu He Trp Phe Ala Met Ser Trp He Leu Asp Gin Phe Pro Lys Tro 
370 375 380 

Phe Pro He Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Arg Phe 
385 390 395 400 

Asp Lys Glu Gly His Pro Ser Gin Leu Ala Pro Val Asd Phe Phe Val 
405 410 415 

Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu Val Thr Ala Asn Thr 
420 425 430 

Val Leu Ser He Leu Ser Val Asp Tyr Pro Val Asp Lys Val Ser Cys 
435 440 445 

Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu Ala Leu Ser 
450 455 460 

Glu Thr Ser Glu Phe Ala Lys Lys Trp Val Pro Phe Cys Lys Arg Tyr 
*65 470 475 480 

Ser Leu Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin Gin Lys He Asp 
485 490 495 

Tyr Leu Lys Asp Lys Val Ala Pro Asn Phe Val Arg Glu Arg Arg Ala 
500 505 "* 510 

Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala Leu Val 
515 520 525 

Ala Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr Met Gin Asp Gly 
530 535 540 

Thr Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro Gly Met He Gin 
545 550 555 * 560 

Val Phe Leu Gly Gin Ser Gly Gly His Asp Val Glu Gly Asn Glu Leu 
565 570 575 

Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Tyr Asn His 
580 585 590 



His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Val Ser Ala Val 
595 600 605 
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Leu Thr Asn Ala Pro Tyr Leu Leu Asn Leu Asp Cys Asp His Tyr lie 
610 615 620 

Asn Asn Ser Lys Ala He Lys Glu Ala Met Cys Phe Met Met Asp Pro 
625 630 635 640 

Leu Leu Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp 
645 650 655 

Gly lie Asp Arg His Asp Arg Tyr Ala Asn Arg Asn Val Val Phe Phe 
660 665 670 

Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro He Tyr Val 
675 680 685 

Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly Tyr Asp Ala 
690 695 700 

Pro Lys Thr Lys Lys Pro Pro Ser Arg Thr Cys Asn Cys Trp Pro Lys 
705 710 715 720 

Trp Cys lie Cys Cys Cys Cys Phe Gly Asn Arg Lys Thr Lys Lys Lvs 
725 730 " ' 735 * 

Thr Lys Thr Ser Lys Pro Lys Phe Glu Lys He Lys Lys Leu Phe Lys 
740 745 750 

Lys Lys Glu Asn Gin Ala Pro Ala Tyr Ala Leu Gly Glu He Asp Glu 
755 760 765 

Ala Ala Pro Gly Ala Glu Asn Glu Lys Ala Ser He Val Asn Gin Gin 
770 775 780 

Lys Leu Glu Lys Lys Phe Gly Gin Ser Ser Val Phe Val Ala Ser Thr 
785 790 795 800 

Leu Leu Glu Asn Gly Gly Thr Leu Lys Ser Ala Ser Pro Ala Ser Leu 
805 810 815 

Leu Lys Glu Ala lie His Val He Ser Cys Gly Tyr Glu Asp Lys Thr 
820 825 830 

Gly Trp Gly Lys Asp lie Gly Trp He Tyr Gly Ser Val Thr Glu Asp 
835 840 845 

lie Leu Thr Gly Phe Lys Met His Cys His Gly Trp Arg Ser lie Tyr 
850 855 860 



Cys He Pro Lys Arg Ala Ala Phe Lys Gly Ser Ala Pro Leu Asn Leu 
865 870 875 880 

Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser lie Glu 
885 890 895 

He Phe Phe Ser Asn His Cys Pro Leu Trp Tyr Gly Tyr Gly Gly Gly 
900 905 910 

Leu Lys Phe Leu Glu Arg Phe Ser Tyr lie Asn Ser lie Val Tyr Pro 
915 920 925 
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Trp Thr Ser lie Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala He Cys 
930 935 940 

Leu Leu Thr Gly Lys Phe He Thr Pro Glu Leu Asn Asn Val Ala Ser 
945 950 955 960 

Leu Trp Phe Met Ser Leu Phe He Cys He Phe Ala Thr Ser lie Leu 
965 970 975 

Glu Met Arg Trp Ser Gly Val Gly He Asp Asp Trp Trp Arg Asn Glu 
980 985 990 

Gin Phe Trp Val He Gly Gly Val Ser Ser His Leu Phe Ala Val Phe 
995 1000 1005 

Gin Gly Leu Leu Lys Val He Ala Gly Val Asp Thr Ser Phe Thr Val 
1010 1015 1020 

Thr Ser Lys Gly Gly Asp Asp Glu Glu Phe Ser Glu Leu Tyr Thr Phe 
1025 1030 1035 1040 

Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu Leu Leu Asn 
1045 1050 1055 

Phe He Gly Val Val Ala Gly He Ser Asn Ala He Asn Asn Gly Tyr 
1060 1065 1070 

Glu Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp Val 
1075 1080 1085 

lie Val His Leu Tyr Pro Phe Leu Lys Gly Leu Val Gly Arg Gin Asn 
1090 1095 H00 

Arg Thr Pro Thr lie Val lie Val Trp Ser lie Leu Leu Ala Ser lie 
1105 1110 ins H20 

Phe Ser Leu Leu Trp Val Arg lie Asp Pro Phe Leu Ala Lys Asp Asp 
1125 H30 " H35 

Gly Pro Leu Leu Glu Glu Cys Gly Leu Asd Cys Asn 
1140 H45 



<210> 5 

<211> 1189 

<212> DNA 

<213> Zea mays 



<400> 5 

gcacgagacc gagtggggca aggagattgg 
cctgacgggg ttcaagatgc actgccgggg 
gccggcgttc aaggggtcgg cgcccatcaa 
ctgggcgctg gggtccgtgg agatcttcat 
cggcggccgg ctcaagtggc tggagcgctt 
cacctccatc ccgctcctcg cctactgcac 
gttcatcatt cccacgctga acaacctcgc 
catcatcgcg acgagcgtcc tggagctgcg 
gcgcaacgag cagttctggg tcatcggcgg 
gggcttcctc aaggttctgg gcggcgtgga 
cggcgacgag gccgacgcct tcggggacct 
gccccccacc acgctcatca tcatcaacat 



gtggatctat gggtcggtga cagaggatat 60 

gtggaagtcc gtgtactgca cgccgacacg 120 

cttgtctgat cgtctccacc aggtgctgcg 180 

gagccgccac tgcccgctct ggtacgccta 240 

cgcctacacc aacaccatcg tgtacccctt 300 

catccccgcc gtctgcctgc tcaccggcaa 360 

cagcatctgg ttcatcgcgc tcttcctgtc 420 

gtggagcggg gtgagcatcg aggactggtg 480 

cgtgtccgcg catctcttcg ccgtgttcca 540 

caccagcttc accgtcacct ccaaggcggc 600 

ctacctcttc aagtggacca ccctgctggt 660 

ggtgggcatc gtggccggcg tgtccgacgc 720 
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cgtcaacaac ggctacggct cctggggccc gctcttcggc aagctcttct tctccttctg 780 

ggtcatcgtc cacctctacc cgttcctcaa ggggctcatg gggaggcaga accggacgcc 840 

caccatcgtc gtgctctggt ccatcctcct cgcctccatc ttctcgctcg tctgggtcag 900 

gatcgacccg tttatcccga aggccaaggg ccccatcctc aagccatgcg gagtcgagtg 960 

ctgagctcac ctagctacct tcttgttgca tgtacggacg ccgccgtgcg tttggacata 1020 

caggcacttt tgggccaggc tactcatgtt cgactttrtt tttaattttg tacaagattt 1080 

gtgatcgagt gactgagtga gacagagtgt tgggtgtaag aactgtgatg gaattcactc 1140 

aaattaatgg acattttttt tcttcaactg caaaaaaaaa aaaaaaaaa H89 

<210> 6 

<211> 320 

<212> PRT 

<213> Zea mays 

<400> 6 

His Glu Thr Glu Trp Gly Lys Glu He- Gly Trp lie Tyr Gly Ser Val 
15 10 15 

Thr Glu Asp lie Leu Thr Gly Phe Lys Met His Cys Arg Gly Trp Lvs 
20 25 30 

Ser Val Tyr Cys Thr Pro Thr Arg Pro Ala Phe Lys Gly Ser Ala Pro 
35 40 45 

He Asn Leu Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly 
50 55 60 

Ser Val Glu He Phe Met Ser Arg His Cys Pro Leu Trp Tyr Ala Tyr 
65 70 75 * 80 

Gly Gly Arg Leu Lys Trp Leu Glu Arg Phe Ala Tyr Thr Asn Thr He 
85 90 95 

Val Tyr Pro Phe Thr Ser He Pro Leu Leu Ala Tyr Cys Thr He Pro 
100 105 no 

Ala Val Cys Leu Leu Thr Gly Lys Phe He He Pro Thr Leu Asn Asn 
115 120 125 

Leu Ala Ser He Trp Phe He Ala Leu Phe Leu Ser He He Ala Thr 
130 135 140 

Ser Val Leu Glu Leu Arg Trp Ser Gly Val Ser He Glu Asp Trp Trp 
145 150 155 160 

Arg Asn Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu Phe 
165 170 175 

Ala Val Phe Gin Gly Phe Leu Lys Val Leu Gly Gly Val Asp Thr Ser 
180 185 190 

Phe Thr Val Thr Ser Lys Ala Ala Gly Asp Glu Ala Asp Ala Phe Gly 
195 200 205 

Asp Leu Tyr Leu Phe Lys Trp Thr Thr Leu Leu Val Pro Pro Thr Thr 
210 215 220 

Leu He He He Asn Met Val Gly He Val Ala Gly Val Ser Asp Ala 
225 230 235 240 
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Val Asn Asn Gly Tyr Gly Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe 
245 250 255 

Phe Ser Phe Trp Val lie Val His Leu Tyr Pro Phe Leu Lys Gly Leu 
260 265 270 

Met Gly Arg Gin Asn Arg Thr Pro Thr He Val Val Leu Trp Ser He 
275 280 285 

Leu Leu Ala Ser lie Phe Ser Leu Val Trp Val Arg He Asp Pro Phe 
290 295 300 

He Pro Lys Ala Lys Gly Pro He Leu Lys Pro Cys Gly Val Glu Cvs 
305 310 315 320 

<210> 7 

<211> 3786 

<212> DNA 

<213> Zea mays 

<400> 7 

ccacagctca tataccaaga gccggagcag crtagcgcag cccagagcgg cgccgcgcca 60 
agcacaaccc ccacccgcca cagccgcgtg cgcatgtgag cggtcgccgc ggccgggaga 120 
ccagaggagg ggaggactac gtgcatttcg ctgtgccgcc gccgcggggt tcgtgcgcga 180 
gcgagatccg gcggggcggg gcggggggcc tgagatggag gctagcgcgg ggctggtggc 24 0 
cggctcgcat aaccggaacg agctggtggt gatccgccgc gaccgcgagt cgggagccgc 300 
gggcggcggc gcggcgcgcc gggcggaggc gccgtgccag atatgcggcg acgaggtcgg 360 
ggtgggcttc gacggggagc ccttcgtggc gtgcaacgag tgcgccttcc ccgtctgccg 420 
cgcctgctac gagtacgagc gccgcgaggg ctcgcaagcg tgcccgcagt gcaggacccg 480 
ctacaagcgc ctcaagggct gcccgcgggt ggccggcgac gaggaggagg acggcgtcga 540 
cgacctggag ggcgagttcg gcctgcagga cggcgccgcc cacgaggacg acccgcagta 600 
cgtcgccgag tccatgctca gggcgcagat gagctacggc cgcggcggcg acgcgcaccc 660 
cggcttcagc cccgtcccca acgtgccgct cctcaccaac ggccagatgg ttgatgacat 720 
cccgccggag cagcacgcgc tcgtgccgtc ctacatgagc ggcggcggcg gcgggggcaa 780 
gaggatccac ccgctccctt tcgcagatcc caaccttcca gtgcaaccga gatccatgga 840 
cccgtccaag gatctggccg cctacggata tggcagcgtg gcctggaagg agagaatgga 900 
gggctggaag cagaagcagg agcgcctgca gcatgtcagg agcgagggtg gcggtgattg 960 
ggatggcgac gatgcagatc tgccactaac ggatgaagct aggcagccat tgtccagaaa 1020 
agtccctata tcatcaagcc gaattaatcc ctacaggatg attatcgtta tccggttggt 1080 
ggttttgggt ttcttcttcc actaccgagt gatgcatccg gcgaaagatg catttgcatt 1140 
gtggctcata tctgtaatct gtgaaatctg gtttgcgatg tcctggattc ttgatcagtt 1200 
cccaaagtgg cttccaatcg agagagagac ttacctggac cgtttgtcac taaggtttga 1260 
caaggaaggt caaccctctc agcttgctcc aatcgacttc tttgtcagta cggttgatcc 1320 
cacaaaggaa cctcccttgg tcacagcgaa cactgtcctt tccatccttt ctgtggatta 1380 
tccggttgag aaggtctcct gctatgtttc tgatgatggt gctgcaatgc ttacgtttga 1440 
agcattgtct gaaacatctg aatttgcaaa gaaatgggtt cctttcagca aaaagtttaa 1500 
tatcgagcct cgtgctcctg agtggtactt ccaacagaag atagactacc tgaaagacaa 1560 
ggttgctgct tcatttgtta gggagaggag ggcgatgaag agagaatacg aggaattcaa 1620 
ggtaaggatc aatgccttgg ttgcaaaagc ccaaaaggtt cctgaggaag gatggacaat 1680 
gcaagatgga agcccctggc ctggaaacaa cgtacgcgat catcctggaa tgattcaggt 1740 
attccttggc caaagtggcg gtcgtgatgt ggaaggaaat gagttgcctc gcctggttta 1800 
tgtctcgaga gaaaagaggc caggttataa ccatcacaag aaggctggtg ccatgaatgc 1860 
actggtccgt gtctctgctg tcttatcaaa tgctgcatac ctattgaact tggactgtga 1920 
tcactacatc aacaatagca aggccataaa agaggctatg tgtttcatga tggatccttt 1980 
ggtggggaag aaagtgtgct atgtacagtt ccctcagagg tttgatggta ttgacaaaaa 2040 
tgatcgatac gctaacagga acgttgtctt ttttgacatc aacatgaaag gtttggacgg 2100 
tattcaagga cccatttatg tgggtactgg atgtgttttc agacggcagg cactgtatgg 2160 
ttatgatgct cctaaaacga agaagccacc atcaagaact tgcaactgct ggcccaagtg 2220 
gtgcctctct tgctgctgca gcaggaacaa gaataaaaag aagactacaa aaccaaagac 2280 
ggagaagaag aaaagattat ttrtcaagaa agcagaaaac ccatctcctg catatgcttt 2340 
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gggtgaaatt gatgaaggtg ctccaggtgc tgatatcgag aaggccggaa tcgtaaatca 2400 
acagaaacta gagaagaaat ttgggcagtc ttctgttttt gtcgcatcaa cacttcttga 2460 
gaacggaggg accctgaaga gcgcaagtcc agcttctctt ctgaaggaag ctatacatgt 2520 
tatcagctgc ggctacgaag acaagaccga ctggggaaaa gagattggct ggatttacgg 2580 
atcgatcaca gaggatatct tgactggatt taagatgcac tgccatggct ggcggtctat 2640 
ttactgcatc ccgaagcggc ctgcattcaa aggttctgcg cctctgaacc tttccgaccg 2700 
tcttcaccag gtccttcgct gggcccttgg gtccgtcgaa attttcttca gcaagcactg 2760 
cccactttgg tacggatacg gcggcgggct aaaattcctg gaaaggtttt cttatatcaa 2820 
ctccatcgtt tatccctgga cgtccattcc tctcctggct tactgtacct tgcctgccat 2880 
ctgcctgctc acggggaagt ttatcacacc agagcttacc aatgtcgcca gtatctggtt 2940 
catggcactt ttcatctgca tctccgtgac cggcatcctg gaaatgaggt ggagtggcgt 3000 
ggccatcgac gactggtgga ggaacgagca gttctgggtc atcggaggcg tttcggcgca 3060 
tctgttcgcg gtgttccagg gcctgctgaa ggtgttcgcc ggcatcgaca cgagcttcac 3120 
cgtgacgtcg aaggccgggg acgacgagga gttctcggag ctgtacacgt tcaagtggac 3180 
caccctgctg atacccccga ccacgctcct cctgctgaac ttcatcgggg tggtggccgg 3240 
gatctcgaac gcgatcaaca acgggtacga gtcgtggggc cccctgttcg ggaagctctt 3300 
cttcgccttc tgggtgatcg tccacctgta cccgttcctc aagggtctgg tggggaggca 3360 
gaacaggacg ccgacgatcg tcatcgtctg gtccatcctg ctggcctcga tcttctcgct 3420 
cctgtgggtc cgcgtcgacc cgttcctcgc caagagcaac ggcccgctcc tggaggagtg 3480 
tggcctggac tgcaactgaa gtgggggccc cctgtcactc gaagttctgt cacgggcgaa 3540 
ttacgcctga ttttttgttg ttgttgttgt tggaattctt tgctgtagat agaaaccaca 3600 
tgtccacggc atctctgctg tgtccattgg agcaggagag aggtgcctgc tgctgtttgt 3660 
tgagtaaatt aaaagtttta aagttataca gtgatgcaca ttccagtgcc cagtgtattc 3720 
cctttttaca gtctgtatat tagcgacaaa ggacatattg gttaggagtt tgattctttt 3780 
gtaaaa 3786 

<210> 8 

<211> 1165 

<212> PRT 

<213> Zea mays 

<400> 8 

His Ser Ser Tyr Thr Lys Ser 
1 5 

Ala Pro Arg Gin Ala Gin Pro 
20 

Glu Arg Ser Pro Arg Pro Gly 
35 

Phe Arg Cys Ala Ala Ala Ala 
50 55 

Gly Arg Gly Gly Gly Pro Glu 
65 70 

Gly Ser His Asn Arg Asn Glu 
85 

Ser Gly Ala Ala Gly Gly Gly 
100 

Gin lie Cys Gly Asp Glu Val 
115 

Val Ala Cys Asn Glu Cys Ala 
130 135 



Arg Ser Ser Leu Ala Gin Pro Arg Ala 
10 15 

Pro Pro Ala Thr Ala Ala Cys Ala Cys 
25 30 

Asp Gin Arg Arg Glv Gly Leu Ara Ala 
40 45 

Gly Phe Val Arg Glu Arg Asp Pro Ala 
60 

Met Glu Ala Ser Ala Gly Leu Val Ala 
75 80 

Leu Val Val He Arg Arg Asp Arg Glu 
90 95 

Ala Ala Arg Arg Ala Glu Ala Pro Cys 
105 110 

Gly Val Gly Phe Asp Gly Glu Pro Phe 
120 125 



Phe Pro Val Cys Arg Ala Cys Tyr Glu 
140 
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Tyr Glu Arg Arg Glu Gly Ser Gin Ala Cys Pro Gin Cys Arg Thr Arg 
145 150 155 160 

Tyr Lys Arg Leu Lys Gly Cys Pro Arg Val Ala Gly Asp Glu Glu Glu 
165 170 175 

Asp Gly Val Asp Asp Leu Glu Gly Glu Phe Gly Leu Gin Asp Gly Ala 
180 185 190 

Ala His Glu Asp Asp Pro Gin Tyr Val Ala Glu Ser Met Leu Arg Ala 
195 200 205 

Gin Met Ser Tyr Gly Arg Gly Gly Asp Ala His Pro Gly Phe Ser Pro 
210 215 220 

Val Pro Asn Val Pro Leu Leu Thr Asn Gly Gin Met Val Asp Asp He 
225 230 235 240 

Pro Pro Glu Gin His Ala Leu Val Pro Ser Tyr Met Ser Gly Gly Gly 
245 250 255 

Gly Gly Gly Lys Arg He His Pro Leu Pro Phe Ala Asp Pro Asn Leu 
260 265 270 

Pro Val Gin Pro Arg Ser Met Asp Pro Ser Lys Asp Leu Ala Ala Tyr 
275 280 285 

Gly Tyr Gly Ser Val Ala Trp Lys Glu Arg Met Glu Gly Trp Lys Gin 
290 295 300 

Lys Gin Glu Arg Leu Gin His Val Arg Ser Glu Gly Gly Gly Asp Trp 
305 310 315 ' 320 

Asp Gly Asp Asp Ala Asp Leu Pro Leu Met Asp Glu Ala Arg Gin Pro 
325 330 335 

Leu Ser Arg Lys Val Pro He Ser Ser Ser Arg He Asn Pro Tyr Arg 
340 345 350 

Met He He Val He Arg Leu Val Val Leu Gly Phe Phe Phe His Tyr 
355 360 365 

Arg Val Met His Pro Ala Lys Asp Ala Phe Ala Leu Trp Leu He Ser 
370 375 380 

Val He Cys Glu He Trp Phe Ala Met Ser Trp He Leu Asp Gin Phe 
385 390 395 * 400 

Pro Lys Trp Leu Pro He Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser 
405 410 415 

Leu Arg Phe Asp Lys Glu Gly Gin Pro Ser Gin Leu Ala Pro He Asp 
420 425 430 

Phe Phe Val Ser Thr Val Asp Pro Thr Lys Glu Pro Pro Leu Val Thr 
435 440 445 



Ala Asn Thr Val Leu Ser He Leu Ser Val Asp Tyr Pro Val Glu Lys 
450 455 460 
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Val Ser Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu 
465 470 475 480 

Ala Leu Ser Glu Thr Ser Glu Phe Ala Lys Lys Trp Val Pro Phe Ser 
485 490 495 

Lys Lys Phe Asn He Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin Gin 
500 505 510 

Lys He Asp Tyr Leu Lys Asp Lys Val Ala Ala Ser Phe Val Arg Glu 
515 520 525 

Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn 
530 535 540 

Ala Leu Val Ala Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr Met 
545 550 555 *' ' 560 

Gin Asp Gly Ser Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro Gly 
565 570 575 

Met He Gin Val Phe Leu Gly Gin Ser Gly Gly Ara Asp Val Glu Gly 
580 585 ' 590 

Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly 
595 600 605 

Tyr Asn His His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Val 
610 615 620 

Ser Ala Val Leu Ser Asn Ala Ala Tyr Leu Leu Asn Leu Asp Cys Asp 
625 630 635 640 

His Tyr He Asn Asn Ser Lys Ala lie Lys Glu Ala Met Cys Phe Met 
645 650 655 

Met Asp Pro Leu Val Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin 
660 665 670 

Arg Phe Asp Gly He Asp Lys Asn Asp Arg Tyr Ala Asn Arg Asn Val 
675 680 685 

Val Phe Phe Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro 
690 695 700 

lie Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly 
705 710 715 720 

Tyr Asp Ala Pro Lys Thr Lys Lys Pro Pro Ser Arg Thr Cys Asn Cys 
725 730 " 735 

Trp Pro Lys Trp Cys Leu Ser Cys Cys Cys Ser Arg Asn Lys Asn Lys 
740 745 750 

Lys Lys Thr Thr Lys Pro Lys Thr Glu Lys Lys Lys Arg Leu Phe Phe 
755 760 765 



Lys Lys Ala Glu Asn Pro Ser Pro Ala Tyr Ala Leu Gly Glu lie Asp 
770 775 780 
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Glu Gly Ala Pro Gly Ala Asp lie Glu Lys Ala Sly He Val Asn Gin 
785 790 795 800 

Gin Lys Leu Glu Lys Lys Phe Gly Gin Ser Ser Val Phe Val Ala Ser 
805 810 815 

Thr Leu Leu Glu Asn Gly Gly Thr Leu Lys Ser Ala Ser Pro Ala Ser 
820 825 830 

Leu Leu Lys Glu Ala He His Val lie Ser Cys Gly Tyr Glu Asp Lys 
835 840 845 

Thr Asp Trp Gly Lys Glu He Gly Trp He Tyr Gly Ser He Thr Glu 
850 855 860 

Asp He Leu Thr Gly Phe Lys Met His- Cys His Gly Trp Arg Ser He 
865 870 875 ' 880 

Tyr Cys He Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro Leu Asn 
885 890 895 

Leu Ser Asp Arg Leu His Gin Val Leu Arg Tro Ala Leu Gly Ser Val 
900 905 910 

Glu He Phe Phe Ser Lys His Cys Pro Leu Trp Tyr Gly Tvr Glv Glv 
915 920 925 

Gly Leu Lys Phe Leu Glu Arg Phe Ser Tyr He Asn Ser He Val Tyr 
930 935 940 

Pro Trp Thr Ser He Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala He 
945 950 955 960 

Cys Leu Leu Thr Gly Lys Phe He Thr Pro Glu Leu Thr Asn Val Ala 
965 970 975 

Ser lie Trp Phe Met Ala Leu Phe He Cys He Ser Val Thr Gly He 
980 985 990 

Leu Glu Met Arg Trp Ser Gly Val Ala He Asp Asp Trp Trp Arg Asn 
995 1000 1005 

Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu Phe Ala Val 
1010 1015 1020 

Phe Gin Gly Leu Leu Lys Val Phe Ala Gly He Asp Thr Ser Phe Thr 
1025 1030 1035 1040 

Val Thr Ser Lys Ala Gly Asp Asp Glu Glu Phe Ser Glu Leu Tyr Thr 
1045 1050 1055 

Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu Leu Leu 
1060 1065 1070 

Asn Phe He Gly Val Val Ala Gly He Ser Asn Ala He Asn Asn Gly 
1075 1080 1085 

Tyr Glu Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp 
1090 1095 iioo 
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Val He Val His Leu Tyr Pro Phe Leu Lys Gly Leu Val Gly Arg Gin 
1105 1110 ins " H20 

Asn Arg Thr Pro Thr He Val He Val Trp Ser He Leu Leu Ala Ser 
1125 H30 U35 

He Phe Ser Leu Leu Trp Val Arg Val Asp Pro Phe Leu Ala Lys Ser 
1140 H45 H50 

Asn Gly Pro Leu Leu Glu Glu Cys Gly Leu Asp Cys Asn 
1155 1160 H65 

<210> 9 
<211> 3936 
<212> DNA 
<213> Zea mays 

<400> 9 

cttctccctc gtcggtgcgg cgtggcgcgg ctcggcgttc ggtgagaaac cactcggggg 60 
atgaggatct gctgctagag tgagaggagc tacggtcagt atcctctgcc ttcgtcggcg 120 
gcggaagtgg aggggaggaa gcgatggagg cgagcgccgg gctggtggcc ggctcccaca 180 
accgcaacga gctcgtcgtc atccgccgcg acggcgatcc cgggccgaag ccgccgcggg 240 
agcagaacgg gcaggtgtgc cagatttgcg gcgacgacgt cggccttgcc cccggcgggg 300 
accccttcgt ggcgtgcaac gagtgcgcct tccccgtctg ccgggactgc tacgaatacg 360 
agcgccggga gggcacgcag aactgccccc agtgcaagac tcgatacaag cgcctcaagg 420 
gctgccaacg tgtgaccggt gacgaggagg aggacggcgt cgatgacctg gacaacgagt 480 
tcaactggga cggccatgac tcgcagtctg tggccgagtc catgctctac ggccacatga 540 
gctacggccg tggaggtgac cctaatggcg cgccacaagc tttccagctc aaccccaatg 600 
ttccactcct caccaacggg caaatggtgg atgacatccc accggagcag cacgcgctgg 660 
tgccttcttt catgggtggt gggggaaaga ggatacatcc ccttccttat gcggatccca 720 
gcttacctgt gcaacccagg tctatggacc catccaagga tcttgctgca tatgggtatg 780 
gtagtgttgc ttggaaggaa cggatggaga attggaagca gagacaagag aggatgcacc 840 
agacggggaa tgatggtggt ggtgatgatg gtgacgatgc tgatctacca ctaatggatg 900 
aagcaagaca acaactgtcc aggaaaattc cacttccatc aagccagatt aatccatata 960 
ggatgattat cattattcgg cttgtggttt tggggttctt cttccactac cgagtgatgc 1020 
atccggtgaa tgatgcattt gctttgtggc tcatatctgt tatctgtgaa atctggtttg 1080 
ccatgtcttg gattcttgat caattcccaa agtggttccc tattgagaga gagacttacc 1140 
tagaccggct gtcactgagg ttcgacaagg aaggccagcc atctcaactt gctccaattg 1200 
atttctttgt cagtacggtt gatcccttaa aggaacctcc tttggtcaca acaaatactg 1260 
ttctatctat cctttcggtg gattatcctg ttgataaggt ttcttgctat gtttctgatg 1320 
atggtgctgc aatgctaacg tttgaagcat tatctgaaac atctgaattt gcaaagaaat 1380 
gggttccttt ctgcaaacgg tacaatattg aacctcgcgc tccagagtgg tacttccaac 14 40 
agaagataga ctacttgaaa gacaaggtgg cagcaaactt tgttagggag aggagagcaa 1500 
tgaagagaga gtatgaggaa ttcaaggtga gaatcaatgc cttagttgcc aaagcccaga 1560 
aagttcctga agaaggatgg acaatgcaag atggaacccc ctggcctgga aacaatgttc 1620 
gtgatcatcc tggaatgatt caggtcttcc ttggccaaag cggaggcctt gactgtgagg 1680 
gaaatgaact gccacgattg gtttatgttt ctagagagaa acgaccaggc tataaccatc 1740 
ataagaaagc tggtgctatg aatgcattgg tccgagtctc tgctgtacta acaaatgctc 1800 
catatttgtt aaacttggat tgtgatcact acatcaacaa cagcaaggct ataaaggaag 1860 
caatgtgttt tatgatggac cctttactag gaaagaaggt ttgctatgta cagttccctc 1920 
aaagatttga tgggattgat cgccatgacc gatatgctaa ccggaatgtt gtcttttttg 1980 
atatcaacat gaaaggtttg gatggtattc agggtccaat ttatgttggt actggatgtg 2040 
tatttagaag gcaggcatta tatggttatg atgcccccaa aacaaagaag ccaccatcaa 2100 
ggacttgcaa ctgctggccc aagtggtgct tttgctgttg ctgctttggc aataggaagc 2160 
aaaagaagac taccaaaccc aaaacagaga agaaaaagtt attatttttc aagaaagaag 2220 
agaaccaatc ccctgcatat gctcttggtg aaattgacga agctgctcca ggagctgaga 2280 
atgaaaaggc cggtattgta aatcaacaaa aattagaaaa gaaatttggc caatcttctg 2340 
tttttgttac atccacactt ctcgagaatg gtggaacctt gaagagtgca agtcctgctt 2400 
ctcttttgaa agaagctata catgtcatta gttgtggtta tgaagacaag acagactggg 24 60 
gaaaagagat tggctggatc tatggatcag ttacagaaga tattctaact ggtttcaaga 2520 
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tgcattgtca tggttggcgg tcaatttact gcatacctaa acgggttgca ttcaaaggtt 2580 
ctgcacctct gaatctttca gatcgtcttc accaggtgct tcggtgggct cttgggtcta 2640 
ttgagatctt cttcagcaat cattgccctc tttggtatgg gtatggtggc ggtctgaaat 2700 
ttttggaaag attttcctac atcaactcca tcgtgtatcc ttggacatct attcccctct 2760 
tggcttactg tacattgcct gccatctgtt tattgacagg gaaatttatc actccagagc 2820 
tgaataatgt tgccagcctg tggttcatgt cactttttat ctgcattttt gctacgagca 2880 
tcctagaaat gagatggagt ggtgttggaa ttgatgactg gtggaggaat gagcagttct 2940 
gggtcattgg aggtgtgtcc tcacacctct ttgctgtgtt ccagggactt ctcaaggtca 3000 
tagctggtgt tgatacaagc ttcaccgtga catcaaaggg tggagatgat gaggagttct 3060 
cagagctata tacattcaaa tggactacct tattgatacc tcctaccacc ttgcttctat 3120 
tgaacttcat tggtgtggtc gctggcgttt caaatgcgat caataacgga tatgagtcat 3180 
ggggccccct ctttgggaag ctattctttg cattttgggt gattgtccat ctttatccct 3240 
ttctcaaagg tttggttgga aggcaaaaca ggacaccaac gattgtcatc gtctggtcca 3300 
ttctgctggc ttcaatcttc tcgctccttt gggttcggat tgatcctttc cttgcgaagg 3360 
atgatggtcc gcttcttgag gagtgtggtt tggattgcaa ctaggatgtc agtgcatcag 3420 
ctcccccaat ctgcatatgc ttgaagtata ttttctggtg tttgtcccca tattcagtgt 3480 
ctgtagataa gagacatgaa atgtcccaag tttcttttga tccatggtga acctacttaa 3540 
tatctgagag atatactggg ggaaaatgga ggctgcggca atccttgtgc agttgggccg 3600 
tggaatacag catatgcaag tgtttgattg tgcagcattc tttattactt ggtcgcaata 3660 
tagatgggct gagccgaaca gcaaggtatt ttgattctgc actgctcccg tgtacaaact 3720 
tggttctcaa taaggcaggc aggaatgcat ctgccagtgg aacagagcaa cctgcacatt 3780 
atttatgtat gcctgttcat tggagggctt gttcattaca tgttcgtcta tactagaaaa 3840 
aacagaatat tagcattaat ctatagttaa ttaaagtatg taaatgcgcc tgttttttgt 3900 
tgtgtactgt aatcatctga gttggttttg tgaaaa 3936 

<210> 10 

<211> 1086 

<212> PRT 

<213> Zea mays 

<400> 10 

Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu 
15 10 15 

Leu Val Val He Arg Arg Asp Gly Asp Pro Gly Pro Lys Pro Pro Arg 
20 25 30 

Glu Gin Asn Gly Gin Val Cys Gin He Cys Gly Asp Asp Val Gly Leu 
35 40 45 

Ala Pro Gly Gly Asp Pro Phe Val Ala Cys Asn Glu Cys Ala Phe Pro 
50 55 60 

Val Cys Arg Asp Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Thr Gin Asn 
65 70 75 80 

Cys Pro Gin Cys Lys Thr Arg Tyr Lys Arg Leu Lys Gly Cys Gin Arg 
85 90 ' 95 

Val Thr Gly Asp Glu Glu Glu Asp Gly Val Asp Asp Leu Asp Asn Glu 
100 105 HO 

Phe Asn Trp Asp Gly His Asp Ser Gin Ser Val Ala Glu Ser Met Leu 
115 120 125 

Tyr Gly His Met Ser Tyr Gly Arg Gly Gly Asp Pro Asn Gly Ala Pro 
130 135 140 

Gin Ala Phe Gin Leu Asn Pro Asn Val Pro Leu Leu Thr Asn Gly Gin 
I 45 150 155 160 
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Met Val Asp Asp He Pro Pro Glu Gin His Ala Leu Vai Pro Ser Phe 
165 170 175 

Met Gly Gly Gly Gly Lys Arg He His Pro Leu Pro Tyr Ala Asp Pro 
180 185 190 

Ser Leu Pro Val Gin Pro Arg Ser Met Asp Pro Ser Lys Asp Leu Ala 
195 200 205 

Ala Tyr Gly Tyr Gly Ser Val Ala Trp Lys Glu Arg Met Glu Asn Trp 
210 215 220 

Lys Gin Arg Gin Glu Arg Met His Gin Thr Gly Asn Asd Gly Gly Gly 
225 230 235 240 

Asp Asp Gly Asp Asp Ala Asp Leu Pro Leu Met Asp Glu Ala Arg Gin 
245 250 255 

Gin Leu Ser Arg Lys He Pro Leu Pro Ser Ser Gin He Asn Pro Tyr 
260 265 270 

Arg Met He He He He Arg Leu Val Val Leu Gly Phe Phe Phe His 
275 280 285 

Tyr Arg Val Met His Pro Val Asn Asp Ala Phe Ala Leu Trp Leu He 
290 295 300 

Ser Val lie Cys Glu He Trp Phe Ala Met Ser Trp He Leu Asp Gin 
305 310 315 320 

Phe Pro Lys Trp Phe Pro He Glu Arg Glu Thr Tyr Leu Asp Arg Leu 
325 330 335 

Ser Leu Arg Phe Asp Lys Glu Gly Gin Pro Ser Gin Leu Ala Pro He 
340 345 350 

Asp Phe Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu Val 
355 360 365 

Thr Thr Asn Thr Val Leu Ser He Leu Ser Val Asp Tyr Pro Val Asp 
370 375 380 

Lys Val Ser Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe 
385 390 395 400 

Glu Ala Leu Ser Glu Thr Ser Glu Phe Ala Lys Lys Trp Val Pro Phe 
405 410 ' L 415 

Cys Lys Arg Tyr Asn He Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin 
420 425 430 

Gin Lys He Asp Tyr Leu Lys Asp Lys Val Ala Ala Asn Phe Val Arg 
435 440 445 

Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He 
450 455 460 

Asn Ala Leu Val Ala Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr 
465 470 475 ' 480 
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Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro 
485 4 90 4 95 

Gly Met lie Gin Val Phe Leu Gly Gin Ser Gly Gly Leu Asp Cys Glu 
500 505 510 

Gly Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro 
515 520 525 

Gly Tyr Asn His His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg 
530 535 540 

Val Ser Ala Val Leu Thr Asn Ala Pro Tyr Leu Leu Asn Leu Asp Cys 
545 550 555 560 

Asp His Tyr lie Asn Asn Ser Lys Ala lie Lys Glu Ala Met Cys Phe 
565 570 575 

Met Met Asp Pro Leu Leu Gly Lys Lys Val Cys Tyr Val Gin Phe Pro 
580 585 " " 590 

Gin Arg Phe Asp Gly lie Asp Arg His Asp Arg Tyr Ala Asn Arg Asn 
595 600 605 

Val Val Phe Phe Asp lie Asn Met Lys Gly Leu Asp Gly He Gin Gly 
610 615 J 620 

Pro He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr 
625 630 635 640 

Gly Tyr Asp Ala Pro Lys Thr Lys Lys Pro Pro Ser Arg Thr Cys Asn 
645 650 " 655 

Cys Trp Pro Lys Trp Cys Phe Cys Cys Cys Cys Phe Gly Asn Arg Lys 
660 665 670 

Gin Lys Lys Thr Thr Lys Pro Lys Thr Glu Lys Lys Lys Leu Leu Phe 
675 680 685 

Phe Lys Lys Glu Glu Asn Gin Ser Pro Ala Tyr Ala Leu Gly Glu He 
690 695 700 

Asp Glu Ala Ala Pro Gly Ala Glu Asn Glu Lys Ala Gly He Val Asn 
705 710 715 720 

Gin Gin Lys Leu Glu Lys Lys Phe Gly Gin Ser Ser Val Phe Val Thr 
725 730 735 

Ser Thr Leu Leu Glu Asn Gly Gly Thr Leu Lys Ser Ala Ser Pro Ala 
740 745 750 

Ser Leu Leu Lys Glu Ala He His Val He Ser Cys Gly Tyr Glu Asp 
755 760 765 

Lys Thr Asp Trp Gly Lys Glu He Gly Trp He Tyr Gly Ser Val Thr 
770 775 780 



Glu Asp He Leu Thr Gly Phe Lys Met His Cys His Gly Trp Arg Ser 
785 790 795 " ' 800 
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He Tyr Cys He Pro Lys Arg Val Ala Phe Lys Gly Ser Ala Pro Leu 
805 810 815 

Asn Leu Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser 
820 825 830 

He Glu He Phe Phe Ser Asn His Cys Pro Leu Trp Tyr Gly Tyr Gly 
835 840 845 

Gly Gly Leu Lys Phe Leu Glu Arg Phe Ser Tyr He Asn Ser He Val 
850 855 860 

Tyr Pro Trp Thr Ser He Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala 
865 870 875 880 

He Cys Leu Leu Thr Gly Lys Phe He Thr Pro Glu Leu Asn Asn Val 
885 890 895 

Ala Ser Leu Trp Phe Met Ser Leu Phe He Cys He Phe Ala Thr Ser 
900 905 910 

He Leu Glu Met Arg Trp Ser Gly Val Gly He Asp Asp Trp Trp Arg 
915 920 925 

Asn Glu Gin Phe Trp Val He Gly Gly Val Ser Ser His Leu Phe Ala 
930 935 940 

Val Phe Gin Gly Leu Leu Lys Val He Ala Gly Val Asp Thr Ser Phe 
945 950 955 960 

Thr Val Thr Ser Lys Gly Gly Asp Asp Glu Glu Phe Ser Glu Leu Tyr 
965 970 975 

Thr Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu Leu 
980 985 990 

Leu Asn Phe He Gly Val Val Ala Gly Val Ser Asn Ala lie Asn Asn 
995 1000 1005 

Gly Tyr Glu Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe 
1010 1015 " 1020 

Trp Val He Val His Leu Tyr Pro Phe Leu Lys Gly Leu Val Gly Arg 
1025 1030 1035 1040 

Gin Asn Arg Thr Pro Thr He Val He Val Trp Ser He Leu Leu Ala 
1045 1050 1055 

Ser He Phe Ser Leu Leu Trp Val Arg He Asp Pro Phe Leu Ala Lys 
1060 1065 1070 

Asp Asp Gly Pro Leu Leu Glu Glu Cys Gly Leu Asp Cys Asn 
1075 1080 1085 

<210> 11 

<211> 1138 

<212> DNA 

<213> Oryza sativa 
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<400> 11 

cgctgctccc 

tcgccacggc 

gtgacgtgct 

gcggcccggt 

aacctggagg 

agggggagct 

cgcgagtacg 

cggcgccgcg 

gggagcgtgc 

gacgtgtgcc 

caccggaagg 

gtcaacaact 

ggagccgccg 

gtcgatcccg 

ggcctcgacg 

gcgctctaca 

ctcggagaag 

aggtgaaaaa 

aaaatgaaat 



ggcgatggac 
gaacaccgtg 
atatttccga 
tcgcggcgct 
cgtacttcaa 
acagggggat 
aggagatgcg 
gcgcggccga 
cacagctcgg 
tcccggcgct 
cgggcgccat 
cgcaggccct 
aagacgccgg 
gcgaccgcta 
gcctccaggg 
gcgtcgacct 
acgagcggct 
ctcagctaaa 
ttactttttt 



gtgttcgtca 
ctgtccatat 
cgacgcaggc 
ttgggtgtcg 
cgccggcgag 
ggcgtggccg 
gctgcggatc 
tgaccacgcc 
cgttgcgaac 
tgtgtacgtg 
gaacgcgccc 
ccgcgccggc 
cgcggtcgcg 
cgccaaccac 
ccccatctac 
gccgcgctgg 
atggtccagg 
acctgaccca 
ttttttacca 



ccaccgccga 
atcctcgccg 
gcggaggtga 
ttctgccgga 
ggtggtggtg 
gagctggtgc 
gacgcgctgc 
ggagttgtgc 
gggagcaagc 
tgccgcgaga 
ttcatcctcg 
atctgcttca 
ttcgtccagt 
aaccgcgtcc 
gtcggcaccg 
agaccgcggc 
atgaaacaaa 
agctgtaaca 
aaaaaaaaaa 



ccccgacaag 
cgggctaccc 
cacgtaacgc 
agcacggcgt 
gcaaggcgaa 
gcgacaggag 
aggccgccga 
aggtactgat 
tcatcgacgt 
agcgccgcgg 
acctcgactg 
tgatcgaacg 
tcccgcagcg 
tcttcgactg 
gctgcttgtt 
gttcattggg 
tggtaatatt 
tgggtaaaaa 
aaaaaaaaaa 



gagccgccgc 
cgccggcaag 
ggtcgtggag 
cgagccgagg 
ggtggtggcg 
acgggtgcgc 
tgcgcgccgc 
cgattttgct 
cgcctctgtc 
ccacgcgcac 
cgactactac 
cggcggcggc 
ggtcgacggc 
caccgagctc 
ccgccgtgtc 
ctgtcgccta 
aagtggtcca 
tatatggccc 
aaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1138 



<210> 12 

<211> 341 

<212> PRT 

<213> Oryza sativa 



<400> 12 

Arg Cys Ser Arg Arg Trp Thr Cys Ser Ser Pro Pro Pro Thr Pro Thr 
15 io is 

Arg Ser Arg Arg Ser Pro Arg Arg Thr Pro Cys Cys Pro Tyr lie Leu 
20 25 30 

Ala Ala Gly Tyr Pro Ala Gly Lys Val Thr Cys Tyr He Ser Asp Asp 
35 40 45 

Ala Gly Ala Glu Val Thr Arg Asn Ala Val Val Glu Ala Ala Arg Phe 
50 55 60 

Ala Ala Leu Trp Val Ser Phe Cys Arg Lys His Gly Val Glu Pro Arq 
65 70 75 80 

Asn Leu Glu Ala Tyr Phe Asn Ala Gly Glu Gly Gly Gly Gly Lys Ala 
85 90 f 95 

Lys Val Val Ala Arg Gly Ser Tyr Arg Gly Met Ala Trp Pro Glu Leu 
100 105 no 

Val Arg Asp Arg Arg Arg Val Arg Arg Glu Tyr Glu Glu Met Arg Leu 
115 120 125 

Arg He Asp Ala Leu Gin Ala Ala Asp Ala Arg Arg Arg Arg Arg Gly 
130 135 140 

Ala Ala Asp Asp His Ala Gly Val Val Gin Val Leu He Asp Phe Ala 
145 150 155 160 

Gly Ser Val Pro Gin Leu Gly Val Ala Asn Gly Ser Lys Leu He Asp 
165 170 " 175 
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Val Ala Ser Val Asp Val Cys Leu Pro Ala Leu Val Tyr Val Cys Arg 
180 185 190 

Glu Lys Arg Arg Gly His Ala His His Arg Lys Ala Gly Ala Met Asn 
195 200 205 

Ala Pro Phe He Leu Asp Leu Asp Cys Asp Tyr Tyr Val Asn Asn Ser 
210 215 220 

Gin Ala Leu Arg Ala Gly He Cys Phe Met He Glu Arg Gly Gly Glv 
225 230 235 240 

Gly Ala Ala Glu Asp Ala Gly Ala Val Ala Phe Val Gin Phe Pro Gin 
245 250 255 

Arg Val Asp Gly Val Asp Pro Gly Asp Arg Tyr Ala Asn His Asn Arg 
260 265 270 

Val Leu Phe Asp Cys Thr Glu Leu Gly Leu Asp Gly Leu Gin Gly Pro 
275 280 285 

He Tyr Val Gly Thr Gly Cys Leu Phe Arg Arg Val Ala Leu Tyr Ser 
290 295 300 



Val Asp Leu Pro Arg Trp Arg Pro Arg Arg Ser Leu Gly Cys Arg Leu 
305 310 315 ' 320 

Leu Gly Glu Asp Glu Arg Leu Trp Ser Arg Met Lys Gin Met Val He 
325 330 335 

Leu Ser Gly Pro Arg 
340 



<210> 13 

<211> 3517 

<212> DNA 

<213> Glycine max 



<400> 13 

gcacgagcca acaacaacac ccttatgtgg 
caatcttcct tcataaaaca caaacctttg 
tgttgtgcca ttcccatatt gtcccattca 
cgctgggtca cataaccgca atgagctagt 
tttgaagaac ttggatgggc aagtgtgtga 
ggatggagac ttgtttgtgg cttgcaatga 
tgagtatgaa aggagagaag gaagccacct 
tctcaaaggg agcccccgag tggagggaga 
gcatgaattc aatattgatg agcaaaagaa 
tcatgggagg atgagctatg gaagaggtcc 
acctgtcatt gctggtggtc gttctaggcc 
tgcttatggg gatcagatgt tatcctcttc 
gtctgaacct ggaagtgcaa gatgggacga 
tggatgactg gaaattgcag caaggcaatt 
cagccatgtt agatgaagca aggcaaccac 
aaatcaatcc atatagaatg gtgattgtgg 
gatacagact catgaaccca gtacatgatg 
gtgaaatctg gtttgctttt tcatggattc 
atagagagac ctaccttgac cgtctttcca 
tgcttgctcc tgtagatgtt tttgttagta 
ttacagcaaa cactgttctt tcaatcttgg 



acacattagg tgaggttcaa cagctagcac 60 
atcacacaat ctcaccttaa tttgtgttgt 120 
ctaagacatg gaagccagcg ctggactggt 180 
tgtcattcat ggccatgaag agccgaaggc 240 
gatttgtggt gatggcgtgg gactcacggt 300 
gtgtggtttt ccagtgtgca ggccttgcta 360 
ttgcccacag tgcaaaacca gatacaagcg 420 
tgatgatgaa gaggatgtgg atgatattga 480 
caagcatggc caggttgcag aagccatgct 540 
tgaagatgat gacaattccc agttcccaac 600 
tgtaagtggg gagttcccaa tatcatctaa 660 
actgcataaa agagtgcatc catatccagt 720 
aaaaaaaaga agatggatgg aaagatagaa 780 
tggggcctga accggatgaa gatccagatg 840 
tgtcaaggaa agtgccaata gcatccagca 900 
cacgtctggt tattcttgct ttcttcctca 960 
ccctggggct atggctaacc tctatcatat 1020 
tggatcagtt tcccaaatgg tttcccattg 1080 
tcaggtatga gcgtgaaggt gaaccaaaca 1140 
ccgtggatcc catgaaggaa cctcctctgg 1200 
ccatggatta cccggttgat aaaatatcat 1260 
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gctacatttc tgatgatgga gcctcaatgt gtacatttga gcccttatca gaaactgcag 1320 
agtttgctag aaagtgggta ccgttttgta agaaattttc catagaacct cgggcacctg 1380 
agatgtactt cagcgagaag attgactacc taaaggacaa agtgcaaccc acctttgtta 1440 
aggagcgtcg agctatgaag agggaatacg aagagtttaa ggttaggatc aatgcacttg 1500 
ttgctaaggc ccagaaagtt cctcagggag gatggatcat gcaggatggg acaccatggc 1560 
cagggaataa cactaaggat catcctggta tgattcaagt gtttcttggt agcagtggag 1620 
gtcttgatac tgaaggaaac caacttcctc gccttgttta tgtttccaga gagaaaaggc 1680 
ctggttttca acaccacaag aaagctggtg ccatgaatgc tctggttcgg gtatctgctg 1740 
ttctcacaaa tgctcctttc atgttgaact tggattgtga tcactatgtc aataacagca 1800 
aggctgcccg agaggccatg tgcttcttga tggacccaca aactgggaag aaggtctgct 1860 
atgtccagtt tcctcaaaga tttgatggta ttgatacaca tgatcgttat gccaacagga 1920 
acacagtttt ctttgatatt aacatgaagg gtctagatgg tattcaaggt cctgtatatg 1980 
tggggactgg atgtgttttc aggaggcaag ctttgtatgg ctataatcct cccaagggtc 2040 
caaagcgtcc aaaaatggta agctgtgatt gttgcccgtg ttttggaagc cgcaagaagt 2100 
ataaggagaa gaatgatgca aatggagagg ctgcaagcct aaaagggatg gatgatgaca 2160 
aagaggtgtt gatgtcccaa atgaattttg agaagaaatt tggacaatcc tctatttttg 2220 
tgacttctac cttgatggaa gagggtggtg tgcctccttc ttcaagtcca gctgccctgc 2280 
ttaaagaagc cattcatgtg attagctgtg gatatgaaga taaaactgaa tggggacttg 2340 
agcttggttg gatctatgga tctatcacag aagatattct aacaggtttt aagatgcatt 2400 
gccgtgggtg gaggtccatt tattgtatgc caaagagagc tgcattcaag ggtactgctc 2460 
ctatcaactt gtcagatcgt ctcaaccagg ttcttcgttg ggcacttggt tccattgaga 2520 
ttttctttag tcaccattgc cctctatggt atggcttcaa ggaaaagaag ctaaagtggc 2580 
ttgagagatt tgcctatgca aacacaactg tctatccatt cacctccatt cctctagttg 2640 
cctactgtat tcttccagca gtttgtttac tcactgacaa attcatcatg ccaccgatta 2700 
gcacctttgc tggtttgtac tttgttgctc tcttctcctc aatcattgca actggtattc 2760 
ttgagttgaa atggagtgga gtgagcattg aggaatggtg gagaaatgag cagttttggg 2820 
tcattggtgg tgtatcagct cacctctttg ctgttataca aggtctgcta aaggttctgg 2880 
ctggaattga caccaatttc actgttacat caaaggcaac agatgatgaa gagtttggag 2940 
aattgtacac ctttaagtgg actacactct tgattcctcc aaccactatt ttgatcatta 3000 
acattgttgg tgttgttgct ggaatctcag atgccataaa caatgggtac caatcctggg 3060 
gaccactctt tggaaagctc ttcttttcct tctgggtgat tgtccatctc tatccattcc 3120 
ttaaaggttt gatgggtcgc caaaatcgca cacccaccat tgttgtgatt tggtcagtgc 3180 
tattggcctc tattttctcc ttactttggg taagaattga tccatttgtc ctcaagacta 3240 
agggacctga taccaagcta tgtggaatca actgctaaaa aagactgctt tccctatagt 3300 
attattcttt aaaagatgta tgtagggtac atacattctt ggtttcacaa accaacaaag 3360 
tggcaatgca caaggatcaa taaggaaaga gtgaaaattt tgtgtatcat aaatgagtgt 3420 
tatcattttt gtaaatgttc tcaaggacat ctgttttggt tggaactgcc caaaaattgc 3480 
agttttatct attcactgga aaaaaaaaaa aaaaaaa 3517 

<210> 14 

<211> 1039 

<212> PRT 

<213> Glycine max 

<220> 

<221> UNSURE 
<222> (201) 

<400> 14 

Met Glu Ala Ser Ala 
1 5 

Leu Val Val lie His 
20 

Asp Gly Gin Val Cys 
35 

Asp Gly Asp Leu Phe 
50 



Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu 
10 15 

Gly His Glu Glu Pro Lys Ala Leu Lys Asn Leu 
25 30 

Glu He Cys Gly Asp Gly Val Gly Leu Thr Val 
40 45 

Val Ala Cys Asn Glu Cys Gly Phe Pro Val Cys 
55 60 
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Arg Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Ser His Leu Cys Pro 
65 70 75 80 

Gin Cys Lys Thr Arg Tyr Lys Arg Leu Lys Gly Ser Pro Arg Val Glu 
85 90 95 

Gly Asp Asp Asp Glu Glu Asp Val Asp Asp He Glu His Glu Phe Asn 
100 105 no 

He Asp Glu Gin Lys Asn Lys His Gly Gin Val Ala Glu Ala Met Leu 
115 120 125 

His Gly Arg Met Ser Tyr Gly Arg Gly Pro Glu Asp Asp Asp Asn Ser 
130 135 140 

Gin Phe Pro Thr Pro Val He Ala Gly Gly Arg Ser Arg Pro Val Ser 
145 150 155 160 

Gly Glu Phe Pro He Ser Ser Asn Ala Tyr Gly Asp Gin Met Leu Ser 
165 170 175 

Ser Ser Leu His Lys Arg Val His Pro Tyr Pro Val Ser Glu Pro Gly 
180 185 190 

Ser Ala Arg Trp Asp Glu Lys Lys Xaa Asp Gly Trp Lys Asp Arg Met 
195 200 205 

Asp Asp Trp Lys Leu Gin Gin Gly Asn Leu Gly Pro Glu Pro Asp Glu 
210 215 220 

Asp Pro Asp Ala Ala Met Leu Asp Glu Ala Arg Gin Pro Leu Ser Arg 
225 230 235 240 

Lys Val Pro He Ala Ser Ser Lys He Asn Pro Tyr Arg Met Val He 
245 250 " 255 

Val Ala Arg Leu Val lie Leu Ala Phe Phe Leu Arg Tyr Arg Leu Met 
260 265 270 



Asn Pro Val His Asp Ala Leu Gly Leu Trp Leu Thr Ser He He Cys 
275 280 285 

Glu He Trp Phe Ala Phe Ser Trp He Leu Asp Gin Phe Pro Lys Trp 
290 295 300 

Phe Pro lie Asp Arg Glu Thr Tyr Leu Asp Arg Leu Ser He Arg Tyr 
305 310 315 320 

Glu Arg Glu Gly Glu Pro Asn Met Leu Ala Pro Val Asp Val Phe Val 
325 330 L 335 

Ser Thr Val Asp Pro Met Lys Glu Pro Pro Leu Val Thr Ala Asn Thr 
340 345 350 

Val Leu Ser lie Leu Ala Met Asp Tyr Pro Val Asp Lys lie Ser Cys 
355 360 365 

Tyr lie Ser Asp Asp Gly Ala Ser Met Cys Thr Phe Glu Ser Leu Ser 
370 375 380 
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Glu Thr Ala Glu Phe Ala Arg Lys Trp Val Pro Phe Cys Lys Lvs Phe 
385 390 395 400 

Ser He Glu Pro Arg Ala Pro Glu Met Tyr Phe Ser Glu Lys He Asp 
405 410 415 

Tyr Leu Lys Asp Lys Val Gin Pro Thr Phe Val Lys Glu Arg Arg Ala 
420 425 430 

Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala Leu Val 
435 440 445 

Ala Lys Ala Gin Lys Val Pro Gin Gly Gly Trp He Met Gin Asp Gly 
450 455 460 

Thr Pro Trp Pro Gly Asn Asn Thr Lys Asp His Pro Gly Met He Gin 
465 470 475 480 

Val Phe Leu Gly Ser Ser Gly Gly Leu Asp Thr Glu Gly Asn Gin Leu 
485 490 * 495 

Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Phe Gin His 
500 505 510 

His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Val Ser Ala Val 
515 520 525 

Leu Thr Asn Ala Pro Phe Met Leu Asn Leu Asp Cys Asp His Tyr Val 
530 535 540 

Asn Asn Ser Lys Ala Ala Arg Glu Ala Met Cys Phe Leu Met Asp Pro 
54 5 550 555 560 

Gin Thr Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp 
565 570 575 

Gly He Asp Thr His Asp Arg Tyr Ala Asn Arg Asn Thr Val Phe Phe 
580 585 590 

Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro Val Tyr Val 
595 600 605 

Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly Tyr Asn Pro 
610 615 620 

Pro Lys Gly Pro Lys Arg Pro Lys Met Val Ser Cys Asp Cys Cys Pro 
625 630 635 640 

Cys Phe Gly Ser Arg Lys Lys Tyr Lys Glu Lys Asn Asp Ala Asn Gly 
645 650 655 

Glu Ala Ala Ser Leu Lys Gly Met Asp Asp Asp Lys Glu Val Leu Met 
660 665 - 670 

Ser Gin Met Asn Phe Glu Lys Lys Phe Gly Gin Ser Ser He Phe Val 
675 680 685 

Thr Ser Thr Leu Met Glu Glu Gly Gly Val Pro Pro Ser Ser Ser Pro 
690 695 700 
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Ala Ala Leu Leu Lys Glu Ala He His Val He Ser Cys Gly Tyr Glu 
705 710 715 " 720 

Asp Lys Thr Glu Trp Gly Leu Glu Leu Gly Trp He Tyr Gly Ser He 
725 730 ^ 735 

Thr Glu Asp He Leu Thr Gly Phe Lys Met His Cys Arg Gly Trp Arg 
740 745 750 

Ser He Tyr Cys Met Pro Lys Arg Ala Ala Phe Lys Gly Thr Ala Pro 
755 760 765 

He Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Trp Ala Leu Gly 
770 775 780 

Ser He Glu He Phe Phe Ser His His Cys Pro Leu Trp Tyr Gly Phe 
785 790 795 ** 800 

Lys Glu Lys Lys Leu Lys Trp Leu Glu Arg Phe Ala Tyr Ala Asn Thr 
805 810 ** 815 

Thr Val Tyr Pro Phe Thr Ser lie Pro Leu Val Ala Tyr Cys He Leu 
820 825 830 

Pro Ala Val Cys Leu Leu Thr Asp Lys Phe He Met Pro Pro lie Ser 
835 840 845 

Thr Phe Ala Gly Leu Tyr Phe Val Ala Leu Phe Ser Ser lie lie Ala 
850 855 860 

Thr Gly He Leu Glu Leu Lys Trp Ser Gly Val Ser He Glu Glu Trp 
865 870 875 880 

Trp Arg Asn Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu 
885 890 895 

Phe Ala Val He Gin Gly Leu Leu Lys Val Leu Ala Gly He Asp Thr 
900 905 910 

Asn Phe Thr Val Thr Ser Lys Ala Thr Asp Asp Glu Glu Phe Gly Glu 
915 920 925 

Leu Tyr Thr Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr He 
930 935 940 

Leu lie lie Asn lie Val Gly Val Val Ala Gly He Ser Asp Ala lie 
945 950 955 960 

Asn Asn Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe 
965 970 ' 975 

Ser Phe Trp Val He Val His Leu Tyr Pro Phe Leu Lys Gly Leu Met 
980 985 990 

Gly Arg Gin Asn Arg Thr Pro Thr lie Val Val lie Trp Ser Val Leu 
995 1000 1005 

Leu Ala Ser lie Phe Ser Leu Leu Trp Val Arg lie Asp Pro Phe Val 
1010 1015 1020 
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Leu Lys Thr Lys Gly Pro Asp Thr Lys Leu Cys Gly lie Asn Cys 
1025 1030 1035 

<210> 15 

<211> 2125 

<212> DNA 

<213> Glycine max 

<400> 15 

gccaaagctc agaagatgcc agaggaaggt tggacaatgc aggatggaac tccttggcct 60 
ggaaataatc ctagggatca tccgggaatg attcaggtgt ttttaggtca tagtgggggg 120 
ctggatacag atggaaatga gctgcctaga cttgtttatg tttctcgtga gaagcgacca 180 
ggcttccaac atcacaagaa ggctggagct atgaatgctt tgattcgagt ttctgctgtc 240 
ttgaccaatg gtgcatatct tctgaatgtg gattgtgatc actatttcaa taatagcaaa 300 
gccctcaaag aagccatgtg tttcatgatg gatcctgttc ttggaaagaa gacatgctat 360 
gttcaatttc ctcagagatt tgacggcatt gacttgcacg atcgatatgc caatcgcaat 420 
attgtgttct ttgatatcaa catgaaaggt caggatggtg ttcagggccc agtctatgtg 4 80 
ggaactggtt gttgtttcaa taggcaagct ttgtatggtt atgatcctgt tttgactgag 540 
gaagatttgg aacctaacat tattgtaaag agttgttgcg gttctagaaa gaagggaaag 600 
ggtggcaata agaagtacag tgacaagaag aaggcgatgg gaagaactga atccactgta 660 
cccatattta atatggaaga catagaggag ggtgttgaag gttatgatga tgaaaggaca 720 
ctacttatgt ctcaaaagag cttggagaag cgttttggtc agtctccagt ttttattgct 780 
gccactttca tggagcaggg tggcattcca ccttcaacga accctgcaac tcttcttaag 840 
gaagcaatcc atgttatcag ctgtggttac gaagacaaga cagaatgggg caaagagatt 900 
ggatggatct atggctctgt gacagaagat atcttgactg ggttcaagat gcatgctcgt 960 
ggttggattt ccatctattg catgccacct cgcccagcat ttaagggttc tgctcctatc 1020 
aatctttctg atcgtctcaa tcaggtgctt cggtgggcct tgggttcaat tgagatcttt 1080 
ctaagcaggc attgtccctt gtggtatggc tacaatggga agttgaagcc tctgatgagg 1140 
cttgcttata ttaacaccat tgtctacccg tttacctcaa tcccattgat tgcttactgt 1200 
acgcttcctg cattttgtct tctcacaaat aaatttatta ttcctgagat aagcaacttt 1260 
gccagtatgt ggttcattct tctctttgtc tccattttta ccacttcaat tcttgagctt 1320 
aggtggagtg gggtcagtat agaagactgg tggagaaatg aacagttctg ggttatcggt 1380 
gggacatctg cgcatctctt tgctgtgttc caggggcttc taaaagtgct tgctgggatc 1440 
gatacaaatt ttactgttac atcgaaggca tcggacgagg atggggactt tgccgagctt 1500 
tatgtgttra aatggacatc acttctcatc cctcctacaa cagtgcttat tgtgaatttg 1560 
gttgggattg tggctggtgt atcctatgcc ataaacagtg gttaccagtc ttggggtcca 1620 
ctatttggca agctgttctt tgctatctgg gtcattgccc atctataccc attcttgaag 1680 
ggtctcttgg gcaggcaaaa tcgtacccca accattgtta ttgtttggtc cgttcttctt 1740 
gcttcaatat tctccttgct gtgggtgagg attgatccct tcacctctga ctccaacaaa 1800 
ttaaccaatg gtcaatgtgg catcaactgt tagttctctt gtatgattca ttttgtgttg 1860 
ttattccctt ttgcttggag atacacaagg ttgctgtcgt gtatatagca agaattttca 1920 
gcctatcaaa gttgtctgga ggattgaacc cctgaaatag atgggaatgt accctctctg 1980 
tttctattat ttatctacat gttccttaca agaatagtca gtagtaatgt tgaggtgtat 2040 
gttatatttt ttccccacag aatataaatt tgttcatgcg aatatttaat gaaagccaac 2100 
aaggtcctgt gttgttttgt tcttt 2125 

<210> 16 

<211> 610 

<212> PRT 

<213> Glycine max 

<400> 16 

Ala Lys Ala Gin Lys Met Pro Glu Glu Gly Trp Thr Met Gin Asp Gly 
1 5 10 15 

Thr Pro Trp Pro Gly Asn Asn Pro Arg Asp His Pro Gly Met He Gin 
20 25 30 
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Val Phe Leu Gly His Ser Gly Gly Leu Asp Thr Asp Gly Asn Glu Leu 
35 40 45 

Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Phe Gin His 
50 55 60 

His Lys Lys Ala Gly Ala Met Asn Ala Leu He Arg Val Ser Ala Val 
65 70 75 80 

Leu Thr Asn Gly Ala Tyr Leu Leu Asn Val Asp Cys Asp His Tyr Phe 
85 90 95 

Asn Asn Ser Lys Ala Leu Lys Glu Ala Met Cys Phe Met Met Asp Pro 
100 105 no 

Val Leu Gly Lys Lys Thr Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp 
U5 120 125 

Gly He Asp Leu His Asp Arg Tyr Ala Asn Arg Asn He Val Phe Phe 
130 135 140 

Asp He Asn Met Lys Gly Gin Asp Gly Val Gin Gly Pro Val Tyr Val 
1^5 150 155 160 

Gly Thr Gly Cys Cys Phe Asn Arg Gin Ala Leu Tyr Gly Tyr Asp Pro 
165 170 175 

Val Leu Thr Glu Glu Asp Leu Glu Pro Asn He He Val Lys Ser Cys 
180 185 190 

Cys Gly Ser Arg Lys Lys Gly Lys Gly Gly Asn Lys Lys Tyr Ser Asp 
195 200 205 

Lys Lys Lys Ala Met Gly Arg Thr Glu Ser Thr Val Pro He Phe Asn 
210 215 220 

Met Glu Asp He Glu Glu Gly Val Glu Gly Tyr Asp Asp Glu Arg Thr 
225 230 235 240 

Leu Leu Met Ser Gin Lys Ser Leu Glu Lys Arg Phe Gly Gin Ser Pro 
245 250 255 

Val Phe He Ala Ala Thr Phe Met Glu Gin Gly Gly He Pro Pro Ser 
260 265 " 270 

Thr Asn Pro Ala Thr Leu Leu Lys Glu Ala He His Val He Ser Cys 
275 280 285 

Gly Tyr Glu Asp Lys Thr Glu Trp Gly Lys Glu He Gly Trp He Tyr 
290 295 300 



Gly Ser Val Thr Glu Asp He Leu Thr Gly Phe Lys Met His Ala Arg 
305 310 315 320 

Gly Trp He Ser He Tyr Cys Met Pro Pro Arg Pro Ala Phe Lys Gly 
325 330 335 

Ser Ala Pro He Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Trp 
340 345 350 
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Ala Leu Gly Ser lie Glu He Phe Leu Ser Arg His Cys Pro Leu Trp 
355 360 365 

Tyr Gly Tyr Asn Gly Lys Leu Lys Pro Leu Met Arg Leu Ala Tyr He 
370 375 380 

Asn Thr He Val Tyr Pro Phe Thr Ser He Pro Leu He Ala Tyr Cys 
385 390 395 400 

Thr Leu Pro Ala Phe Cys Leu Leu Thr Asn Lys Phe lie lie Pro Glu 
405 410 415 

lie Ser Asn Phe Ala Ser Met Trp Phe lie Leu Leu Phe Val Ser lie 
420 425 430 

Phe Thr Thr Ser lie Leu Glu Leu Arg Trp Ser Gly Val Ser He Glu 
435 440 445 

Asp Trp Trp Arg Asn Glu Gin Phe Trp Val lie Gly Gly Thr Ser Ala 
450 455 460 

His Leu Phe Ala Val Phe Gin Gly Leu Leu Lys Val Leu Ala Gly lie 
465 470 475 480 

Asp Thr Asn Phe Thr Val Thr Ser Lys Ala Ser Asp Glu Asp Gly Asp 
485 490 495 

Phe Ala Glu Leu Tyr Val Phe Lys Trp Thr Ser Leu Leu He Pro Pro 
500 505 510 

Thr Thr Val Leu lie Val Asn Leu Val Gly lie Val Ala Gly Val Ser 
515 520 525 

Tyr Ala lie Asn Ser Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys 
530 535 540 

Leu Phe Phe Ala lie Trp Val lie Ala His Leu Tyr Pro Phe Leu Lys 
545 550 555 560 

Gly Leu Leu Gly Arg Gin Asn Arg Thr Pro Thr lie Val lie Val Trp 
565 570 575 

Ser Val Leu Leu Ala Ser lie Phe Ser Leu Leu Trp Val Arg lie Asp 
580 585 590 

Pro Phe Thr Ser Asp Ser Asn Lys Leu Thr Asn Gly Gin Cys Gly lie 
595 600 605 

Asn Cys 
610 

<210> 17 

<211> 2890 

<212> DNA 

<213> Glycine max 

<400> 17 

gcacgagctc cacccagtta atgatgcata tggcttgtgg ttgacatcag tcatctgtga 60 

aatatggttt gctgtatcat ggataatgga tcagtttcca aaatggtacc caatacagcg 120 

agaaacatac cttgatcgtc tgtcactcag gtatgaaaaa gaagggaagc catctgagtt 180 
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gtccagtgta gacgtctttg tcagtactgt tgatcccatg aaggaacctc cactgattac 240 
agcaaacact gttctatcta tccttgctgt tgattatcca gttgataaag ttgcatgcta 300 
tgtctcagat gatggtgctg ctatgcttac ttttgaagca ctgtctgaga catctgaatt 360 
tgctaggaga tgggttccat tttgtaagaa atacaatatt gagccccggg caccagaatg 420 
gtactttggt cagaagatgg actatctgaa aaataaagta cacccagcat ttgtcaggga 480 
aaggagagca atgaagaggg attatgaaga atttaaggtg aggattaaca gtttggtggc 540 
aacagcacaa aaggttcctg aggatggatg gaccatgcaa gatgggactc cttggcctgg 600 
aaataatgtg agggatcatc ctggcatgat tcaggtcttc cttgggcagg atggtgttcg 660 
tgatgttgaa ggaaatgagc taccccgctt ggtctacgtt tctagagaaa agaggccagg 720 
gtttgatcac cacaaaaagg ctggtgcaat gaatgctctg gtacgggctt cagcaattat 780 
cactaatgca ccctatcttc tgaatgttga ttgtgatcac tacattaaca atagcaaggc 840 
acttagagaa gctatgtgtt ttatgatgga tcctcaacta gggaaaaagg tttgctatgt 900 
gcaatttcct cagcgatttg atggaattga tagacatgat agatattcaa acagaaatgt 960 
tgtatttttc gatattaaca tgaaaggatt ggatgggata caaggtccaa tatatgtcgg 1020 
aactggatgt gttttcagaa ggtacgcact ttatggatat gatgcacctg ccaagaagaa 1080 
accaccgagc aaaacttgta actgttggcc aaagtggtgc tgcctatgtt gtggctctag 1140 
aaagaaaaag aatgccaata gtaagaagga gaaaaagagg aaggtgaagc acagtgaagc 1200 
atcaaagcag atacatgcac ttgaaaatat tgaggcgggg aatgaaggaa ccaacaatga 1260 
gaagacatcc aatctgactc aaacaaagtt ggagaagagg tttggacagt ctccagtatt 1320 
tgtagcctcc acacttttgg atgatggtgg agttccacat ggcgtgagtc ctgcatcact 1380 
tttaaaagaa gccatccagg tcatcagttg tggttatgaa gacaaaacag aatggggaaa 1440 
agaagttggg tggatatatg gttctgtgac agaggatatc ttgactggat ttaaaatgca 1500 
ttgccatggt tggcggtctg tgtattgcat tcctaagcgg cctgcattta aggggtctgc 1560 
gcctatcaac ctttcagatc gtctgcacca agttcttcgg tgggctcttg ggtctgttga 1620 
gatttttttc agcagacatt gtccaatctg gtatggctat ggtggtggat tgaaattgtt 1680 
ggaacgattt tcctacatta actcggtcgt atatccctgg acttccctcc cattgcttgt 1740 
ctactgtact ctaccagcca tatgccttct gactggaaaa tttatcgtac ccgagattag 1800 
caactatgcc agtcttgtgt tcatggccct cttcatatcc attgcagcaa ctggcatcct 1860 
tgagatgcaa tggggcggtg ttagcataga cgactggtgg aggaacgaac agttttgggt 1920 
gatcggaggt gtttcttccc atctatttgc cctatttcag ggtttactga aggtcttggc 1980 
tggtgtgaac acaaacttca ctgtgacctc aaaagcagca gatgatggag aattctcaga 2040 
actctacata ttcaagtgga catcactctt gatccctcca atgactttac ttatcatgaa 2100 
tattgtcggg gtggttgtcg ggatctcaga tgccatcaac aatggttatg actcatgggg 2160 
acctctcttt ggtagattgt tctttgcatt gtgggtgatc ctccatctct accccttctt 2220 
gaaggggttg cttggaaaac aagatagaat gccaaccatt atattggttt ggtcaatcct 2280 
tctggcctcc atcttgactc tcatgtgggt cagaattaac ccgtttgtgt caagagacgg 2340 
ccccgtgtta gaaatttgtg gattgaattg cgacgagtcg tgaataaaga aaagctgaag 2400 
aagaagggtt agttattttt cagctacact gcagtcatgt tgaagaatgc agccaacaca 24 60 
tgcttcacaa agttgcacga attttcggat ggaagtttta tttttcgggt gtgtagagat: 2520 
taaagagagg aaggggaggg ggctgacaca ttgttacctt gtaatagggt tttttcattt 2580 
attctttgat tatattttct gtgggtttta gtgttattct ctttccagtt tcatgtatta 2640 
taagaaagag gcattgaatg ataaattatt ccctcttcaa aatgggggat cctcagtctc 2700 
aagaaattac ttggtcatat ttttaggtat gggtcttgtt ctgtttaaaa ccatttgtaa 2760 
taccgtcaaa actatggata ttcttgttcc tcagatgtgt ttttgtgttt tattatttaa 2820 
cactcaggaa ccttttggtt tgattcaatt attcaatgtt tggatggcac taaaaaaaaa 2880 
aaaaaaaaaa 2890 

<210> 18 

<211> 793 

<212> PRT 

<213> Glycine max 

<400> 18 

His Glu Leu His Pro Val Asn Asp Ala Tyr Gly Leu Trp Leu Thr Ser 
1 5 10 15 

Val He Cys Glu He Trp Phe Ala Val Ser Trp He Met Asp Gin Phe 
20 25 30 
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Pro Lys Trp Tyr Pro He Gin Arg Glu Thr Tvr Leu Asp Arg Leu Ser 
35 40 45 

Leu Arg Tyr Glu Lys Glu Gly Lys Pro Ser Glu Leu Ser Ser Val Asp 
50 55 60 

Val Phe Val Ser Thr Val Asp Pro Met Lys Glu Pro Pro Leu He Thr 
65 70 75 ~ 80 

Ala Asn Thr Val Leu Ser He Leu Ala Val Asp Tyr Pro Val Asp Lys 
85 90 95 

Val Ala Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu 
100 105 HO 

Ala Leu Ser Glu Thr Ser Glu Phe Ala Arg Arg Trp Val Pro Phe Cys 
115 120 125 

Lys Lys Tyr Asn He Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gly Gin 
130 135 140 

Lys Met Asp Tyr Leu Lys Asn Lys Val His Pro Ala Phe Val Arg Glu 
145 150 155 160 

Arg Arg Ala Met Lys Arg Asp Tyr Glu Glu Phe Lys Val Arg He Asn 
165 170 175 

Ser Leu Val Ala Thr Ala Gin Lys Val Pro Glu Asp Gly Trp Thr Met 
180 185 190 

Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro Gly 
195 200 205 

Met He Gin Val Phe Leu Gly Gin Asp Gly Val Arg Asp Val Glu Gly 
210 215 220 

Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly 
225 230 235 240 

Phe Asp His His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Ala 
245 250 255 

Ser Ala lie He Thr Asn Ala Pro Tyr Leu Leu Asn Val Asp Cys Asp 
260 265 270 

His Tyr He Asn Asn Ser Lys Ala Leu Arg Glu Ala Met Cys Phe Met 
275 280 285 

Met Asp Pro Gin Leu Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin 
290 295 300 



Arg Phe Asp Gly lie Asp Arg His Asp Arg Tyr Ser Asn Arg Asn Val 
305 310 315 320 

Val Phe Phe Asp lie Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro 
325 330 * 335 

He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Tyr Ala Leu Tyr Gly 
340 345 * 350 
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Tyr Asp Ala Pro Ala Lys Lys Lys Pro Pro Ser Lys Thr Cys Asn Cys 
355 360 365 

Trp Pro Lys Trp Cys Cys Leu Cys Cys Gly Ser Arg Lys Lys Lys Asn 
370 375 380 

Ala Asn Ser Lys Lys Glu Lys Lys Arg Lys Val Lys His Ser Glu Ala 
385 390 395 400 

Ser Lys Gin He His Ala Leu Glu Asn He Glu Ala Gly Asn Glu Gly 
405 410 415 

Thr Asn Asn Glu Lys Thr Ser Asn Leu Thr Gin Thr Lys Leu Glu Lys 
420 425 430 

Arg Phe Gly Gin Ser Pro Val Phe Val Ala Ser Thr Leu Leu Asp Asp 
435 440 445 

Gly Gly Val Pro His Gly Val Ser Pro Ala Ser Leu Leu Lys Glu Ala 
450 455 460 

He Gin Val He Ser Cys Gly Tyr Glu Asp Lys Thr Glu Trp Gly Lys 
465 470 475 480 

Glu Val Gly Trp lie Tyr Gly Ser Val Thr Glu Asp lie Leu Thr Gly 
485 490 495 

Phe Lys Met His Cys His Gly Trp Arg Ser Val Tyr Cys He Pro Lys 
500 505 510 

Arg Pro Ala Phe Lys Gly Ser Ala Pro He Asn Leu Ser Asp Arg Leu 
515 520 525 

His Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu He Phe Phe Ser 
530 535 540 

Arg His Cys Pro lie Trp Tyr Gly Tyr Gly Gly Gly Leu Lys Leu Leu 
545 550 555 560 

Glu Arg Phe Ser Tyr He Asn Ser Val Val Tyr Pro Trp Thr Ser Leu 
565 570 575 

Pro Leu Leu Val Tyr Cys Thr Leu Pro Ala He Cys Leu Leu Thr Gly 
580 585 590 

Lys Phe lie Val Pro Glu lie Ser Asn Tyr Ala Ser Leu Val Phe Met 
595 600 605 

Ala Leu Phe He Ser He Ala Ala Thr Gly lie Leu Glu Met Gin Trp 
610 615 " 620 

Gly Gly Val Ser lie Asp Asp Trp Trp Arg Asn Glu Gin Phe Trp Val 
625 630 635 640 

lie Gly Gly Val Ser Ser His Leu Phe Ala Leu Phe Gin Gly Leu Leu 
645 650 655 

Lys Val Leu Ala Gly Val Asn Thr Asn Phe Thr Val Thr Ser Lys Ala 
660 665 670 
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Ala Asp Asp Gly Glu Phe Ser Glu Leu Tyr lie Phe Lys Trp Thr Ser 
675 680 685 

Leu Leu He Pro Pro Met Thr Leu Leu He Met Asn He Val Gly Val 
690 695 700 

Val Val Gly He Ser Asp Ala He Asn Asn Gly Tyr Asp Ser Trp Gly 
705 710 715 ' 720 

Pro Leu Phe Gly Arg Leu Phe Phe Ala Leu Trp Val He Leu His Leu 
725 730 735 

Tyr Pro Phe Leu Lys Gly Leu Leu Gly Lys Gin Asp Arg Met Pro Thr 
7 40 745 750 

He He Leu Val Trp Ser He Leu Leu Ala Ser He Leu Thr Leu Met 
755 760 765 

Trp Val Arg He Asn Pro Phe Val Ser Arg Asp Gly Pro Val Leu Glu 
770 775 780 

He Cys Gly Leu Asn Cys Asp Glu Ser 
785 790 

<210> 19 

<211> 1742 

<212> DNA 

<213> Triticum aestivum 
<220> 

<221> unsure 

<222> (9) 

<220> 

<221> unsure 
<222> (271) 

<400> 19 

>wheatcong aagctatgtg cttcctaatg gatccaaacc taggtccgca agtctgttat 60 
gtgcagttcc cacaaaggtt tgatggtatt gataggaatg atcgatatgc aaacaggaac 120 
actgtctttt ttgatattaa cttgaggggc cttgacggca ttcaaggacc agtttatgtg 180 
ggaactggtt gtgttttcaa cagaacggct atctatggtt atgagccccc aattaaggcg 240 
aagaagccag gtttcttggc atcattatgt nggggcaaga agaaggcaag caagtcaaag 300 
aaaaggagct cagataagaa aaagtcgaac aagcatgtgg acagttctgt tccagtattc 360 
aatctcgaag acatagagga gggtgttgaa ggtgctgggt ttgatgatga gaaatcagtt 420 
ctcatgtctc aaatgagctt agagaagaga tttggccagt cagcagcatt tgttgcctcc 480 
actctgatgg aatatggtgg tgttcctcag tcgtccactc cagaatctct tttgaaagaa 540 
gctatccatg tcataagttg tggctatgag gacaagtctg aatggggaac tgagattggt 600 
tggatctatg gatctgtcac agaagatatt ctaactggat tcaagatgca cgcaagaggc 660 
tggcgttcaa tctattgcat gcccaagcgc ccagctttca agggatctgc ccccatcaat 720 
ctttcagatc gtctgaatca agtgctgcgg tgggctcttg gttctgttga aattcttttc 780 
agccggcatt gccccttatg gtatggctac ggagggcgcc tcaagttcct ggagagattc 840 
gcttacatca acaccaccat ttacccacta acctctctcc cgcttctagt ctattgtata 900 
ttgcctgcta tctgtctgct cactggaaag ttcatcatgc cagagattag caacttggcc 960 
agtatctggt tcattgcgct cttcctttca attttcgcca ctggtatcct tgagatgagg 1020 
tggagtggtg ttggcattga cgagtggtgg aggaatgaac agttctgggt cattggaggt 1080 
atctctgccc atctgtttgc cgtctttcag ggtcttctga aggtgcttgc aggtatcgac 1140 
accaacttca ctgtcacctc aaaggctaat gatgaagaag gcgactttgc tgagctctac 1200 
atgttcaagt ggacgacgct tcttatccct ccgacgacca ttttgatcat taacatggtc 1260 
ggtgtcgttg ctggtacctc cxacgccatc aacagtggtt accaatcatg ggggccgctc 1320 
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tttgggaagc tcttctttgc cttctgggtg attgttcact tatacccatt cctcaagggt 1380 

cttatgggca ggcaaaaccg cacaccgacg attgtcatcg tctgggctgt cctcctcgct 1440 

tctatcttct ccttgctgtg ggttcgtgtt gatccattca ctacccgtct cgctggccca 1500 

aatatccaaa cctgtggcat caactgctag gaaagtggga gtttgtagag acagaaaata 1560 

taacagtgat cgagcgacca cctgtggagc cagagaatat ttatgttggg gttgtgaatt 1620 

actacgtttg agaaagttgt caaaattgag aaaacacatt tgtaaataga tgtaatagac 1680 

tatctaccgt tttcatgagg ttaagctctt cttttttgga aaaaaaaaaa aaaaaaaaaa 1740 

aa 1742 

<210> 20 

<211> 506 

<212> PRT 

<213> Triticum aestivum 
<220> 

<221> UNSURE 

<222> (88) 

<400> 20 

Glu Ala Met Cys Phe Leu Met Asp Pro Asn Leu Gly Pro Gin Val Cys 
1 5 10 15 

Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly lie Asp Arg Asn Asp Arg 
20 25 30 

Tyr Ala Asn Arg Asn Thr Val Phe Phe Asp lie Asn Leu Arg Gly Leu 
35 40 45 

Asp Gly He Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe Asn 
50 55 60 

Arg Thr Ala He Tyr Gly Tyr Glu Pro Pro He Lys Ala Lys Lys Pro 
65 70 75 ~ " 80 

Gly Phe Leu Ala Ser Leu Cys Xaa Gly Lys Lys Lys Ala Ser Lys Ser 
85 90 95 

Lys Lys Arg Ser Ser Asp Lys Lys Lys Ser Asn Lys His Val Asp Ser 
100 105 no 

Ser Val Pro Val Phe Asn Leu Glu Asp He Glu Glu Gly Val Glu Gly 
115 120 125 

Ala Gly Phe Asp Asp Glu Lys Ser Val Leu Met Ser Gin Met Ser Leu 
130 135 i 4 o 

Glu Lys Arg Phe Gly Gin Ser Ala Ala Phe Val Ala Ser Thr Leu Met 
I 45 150 155 160 

Glu Tyr Gly Gly Val Pro Gin Ser Ser Thr Pro Glu Ser Leu Leu Lys 
165 170 175 

Glu Ala He His Val He Ser Cys Gly Tyr Glu Asp Lys Ser Glu Trp 
180 185 190 

Gly Thr Glu He Gly Trp He Tyr Gly Ser Val Thr Glu Asp He Leu 
195 200 205 

Thr Gly Phe Lys Met His Ala Arg Gly Trp Arg Ser He Tyr Cys Met 
210 215 220 
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Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro lie Asn Leu Ser Asp 
225 230 235 240 

Arg Leu Asn Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu He Leu 
245 250 255 

Phe Ser Arg His Cys Pro Leu Trp Tyr Gly Tyr Gly Gly Arg Leu Lys 
260 265 270 

Phe Leu Glu Arg Phe Ala Tyr He Asn Thr Thr He Tyr Pro Leu Thr 
275 280 285 

Ser Leu Pro Leu Leu Val Tyr Cys He Leu Pro Ala He Cys Leu Leu 
290 295 300 

Thr Gly Lys Phe He Met Pro Glu He Ser Asn Leu Ala Ser He Trp 
305 310 315 320 

Phe He Ala Leu Phe Leu Ser He Phe Ala Thr Gly He Leu Glu Met 
325 330 * 335 

Arg Trp Ser Gly Val Gly He Asp Glu Trp Trp Arg Asn Glu Gin Phe 
340 345 350 

Trp Val He Gly Gly He Ser Ala His Leu Phe Ala Val Phe Gin Gly 
355 360 365 

Leu Leu Lys Val Leu Ala Gly He Asp Thr Asn Phe Thr Val Thr Ser 
370 375 380 

Lys Ala Asn Asp Glu Glu Gly Asp Phe Ala Glu Leu Tyr Met Phe Lys 
385 390 395 400 

Trp Thr Thr Leu Leu He Pro Pro Thr Thr He Leu He He Asn Met 
405 410 415 

Val Gly Val Val Ala Gly Thr Ser Tyr Ala He Asn Ser Gly Tyr Gin 
420 425 430 

Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp Val He 
435 440 445 

Val His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg Gin Asn Arg 
450 455 460 

Thr Pro Thr He Val He Val Trp Ala Val Leu Leu Ala Ser He Phe 
465 470 475 480 

Ser Leu Leu Trp Val Arg Val Asp Pro Phe Thr Thr Arg Leu Ala Gly 
485 490 495 

Pro Asn He Gin Thr Cys Gly He Asn Cys 



500 



505 



<210> 
<211> 
<212> 
<213> 



21 

1029 
DMA 

Triticura aestivum 
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<400> 21 

gcacgagccg ctcctcacca acggccagat ggttgatgac atcccgccgg agcagcacgc 60 

gctcgtgccg tcctacatga gcggcggcgg cggcgggggc aagaggatcc acccgctccc 120 

tttcgcagat cccaaccttc cagtgcaacc gagatccatg gacccgtcca aggatctggc 180 

cgcctacgga tatggcagcg tggcctggaa ggagagaatg gagggctgga agcagaagca 240 

ggagcgcctg cagcatgtca ggagcgaggg tggcggtgat tgggatggcg acgatgcaga 300 

tctgccacta atggatgaag ctaggcagcc attgtccaga aaagtcccta tatcatcaag 360 

ccgaattaat ccctacagga tgattatcgt tatccggttg gtggttttgg gtttcttctt 420 

ccactaccga gtgatgcatc cggcgaaaga tgcatttgca ttgtggctca tatctgtaat 480 

ctgtgaaatc tggtttgcga tgtcctgtat tcttgatcag ttcccaaagt ggtttccaat 540 

cgagagagag acttacctgg accgtttgtc actaaggttt gacaaggaag gtcaaccctc 600 

tcagcttgct ccaatcgact tctttgtcag tacggttgat cccacaaagg aacctccctt 660 

ggtcacagcg aacactgtcc tttccatcct ttctgtggat tatccggttg agaaggtctc 720 

ctgctatgtt tctgatgatg gtgctgcaat gcttacgttt gaagcattgt ctgaaacatc 780 

tgaatttgca aagaaatggg ttcctttcag caaaaagttt aatatcgagc ctcgtgctcc 840 

tgagtggtac ttccaacaga agatagacta cctgaaagac aaggttgctg cttcatttgt 900 

tagggagagg agggcgatga agagagaata cgaggaattc aaggtaagga tcaatgcctt 960 

ggttgcaaaa gcccaaaagg ttcctgagga aggatggaca atgcaagatg gaagcccctg 1020 

gcctggaaa 1029 

<210> 22 
<211> 340 
<212> PRT 

<213> Triticum aestivum 
<400> 22 

Pro Leu Leu Thr Asn Gly Gin Met Val Asp Asp lie Pro Pro Glu Gin 
15 io is 

His Ala Leu Val Pro Ser Tyr Met Ser Gly Gly Gly Gly Gly Gly Lys 
20 25 ' 30 

Arg He His Pro Leu Pro Phe Ala Asp Pro Asn Leu Pro Val Gin Pro 
35 40 45 

Arg Ser Met Asp Pro Ser Lys Asp Leu Ala Ala Tyr Gly Tyr Glv Ser 
50 55 60 

Val Ala Trp Lys Glu Arg Met Glu Gly Trp Lys Gin Lys Gin Glu Arg 
65 70 75 J 80 

Leu Gin His Val Arg Ser Glu Gly Gly Gly Asp Trp Asp Gly Asp Asp 
85 90 95 

Ala Asp Leu Pro Leu Met Asp Glu Ala Arg Gin Pro Leu Ser Arg Lys 
100 105 no 

Val Pro He Ser Ser Ser Arg He Asn Pro Tyr Arg Met He He Val 
115 120 125 

He Arg Leu Val Val Leu Gly Phe Phe Phe His Tyr Arg Val Met His 
130 135 140 

Pro Ala Lys Asp Ala Phe Ala Leu Trp Leu He Ser Val He Cys Glu 

150 155 160 

He Trp Phe Ala Met Ser Cys He Leu Asp Gin Phe Pro Lys Trp Phe 
165 170 175 
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Pro lie Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Arg Phe Asp 
180 185 190 

Lys Glu Gly Gin Pro Ser Gin Leu Ala Pro lie Asp Phe Phe Val Ser 
195 200 205 

Thr Val Asp Pro Thr Lys Glu Pro Pro Leu Val Thr Ala Asn Thr Val 
210 215 220 

Leu Ser He Leu Ser Val Asp Tyr Pro Val Glu Lys Val Ser Cys Tyr 
225 230 235 240 

Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu Ala Leu Ser Glu 
245 250 255 

Thr Ser Glu Phe Ala Lys Lys Trp Val Pro Phe Ser Lys Lys Phe Asn 
260 265 270 

He Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin Gin Lys He Asp Tyr 
275 280 285 

Leu Lys Asp Lys Val Ala Ala Ser Phe Val Arg Glu Arg Arg Ala Met 
290 295 300 

Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala Leu Val Ala 
305 310 315 320 

Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr Met Gin Asp Gly Ser 
325 330 335 

Pro Trp Pro Gly 
340 

<210> 23 
<211> 1081 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 23 

Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser Tyr Arg Arg Asn Glu 
1 5 io 15 

Leu Val Arg He Arg His Glu Ser Asp Gly Gly Thr Lys Pro Leu Lys 
20 25 - 30 

Asn Met Asn Gly Gin He Cys Gin He Cys Gly Asp Asp Val Gly Leu 
35 40 45 

Ala Glu Thr Gly Asp Val Phe Val Ala Cys Asn Glu Cys Ala Phe Pro 
50 55 60 

Val Cys Arg Pro Cys Tyr Glu Tyr Glu Arg Lys Asp Gly Thr Gin Cys 
65 70 75 80 

Cys Pro Gin Cys Lys Thr Arg Phe Arg Arg His Arg Gly Ser Pro Arg 
85 90 * 95 

Val Glu Gly Asp Glu Asp Glu Asp Asp Val Asp Asp He Glu Asn Glu 
100 105 no 
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Phe Asn Tyr Ala Gin Gly Ala Asn Lys Ala Arg His Gin Arg His Gly 
115 120 125 

Glu Glu Phe Ser Ser Ser Ser Arg His Glu Ser Gin Pro lie Pro Leu 
130 135 140 

Leu Thr His Gly His Thr Val Ser Gly Glu He Arg Thr Pro Asp Thr 
145 150 155 160 

Gin Ser Val Arg Thr Thr Ser Gly Pro Leu Gly Pro Ser Asp Arg Asn 
165 170 175 

Ala He Ser Ser Pro Tyr He Asp Pro Arg Gin Pro Val Pro Val Arg 
180 185 190 

He Val Asp Pro Ser Lys Asp Leu Asn Ser Tyr Gly Leu Gly Asn Val 
195 200 205 

Asp Trp Lys Glu Arg Val Glu Gly Trp Lys Leu Lys Gin Glu Lys Asn 
210 215 220 

Met Leu Gin Met Thr Gly Lys Tyr His Glu Gly Lys Gly Gly Glu He 
225 230 235 240 

Glu Gly Thr Gly Ser Asn Gly Glu Glu Leu Gin Met Ala Asp Asp Thr 
245 250 255 

Arg Leu Pro Met Ser Arg Val Val Pro lie Pro Ser Ser Arg Leu Thr 
260 265 270 

Pro Tyr Arg Val Val He He Leu Arg Leu He He Leu Cys Phe Phe 
275 280 285 

Leu Gin Tyr Arg Thr Thr His Pro Val Lys Asn Ala Tyr Pro Leu Trp 
290 295 300 

Leu Thr Ser Val He Cys Glu He Trp Phe Ala Phe Ser Trp Leu Leu 
305 310 315 320 

Asp Gin Phe Pro Lys Trp Tyr Pro He Asn Arg Glu Thr Tyr Leu Asp 
325 330 335 

Arg Leu Ala He Arg Tyr Asp Arg Asp Gly Glu Pro Ser Gin Leu Val 
340 345 350 

Pro Val Asp Val Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro 
355 360 365 

Leu Val Thr Ala Asn Thr Val Leu Ser He Leu Ser Val Asp Tyr Pro 
370 375 380 

Val Asp Lys Val Ala Cys Tyr Val Ser Asp Asp Gly Ser Ala Met Leu 
385 390 395 400 

Thr Phe Glu Ser Leu Ser Glu Thr Ala Glu Phe Ala Lys Lys Trp Val 
405 410 * 415 



Pro Phe Cys Lys Lys Phe Asn He Glu Pro Arg Ala Pro Glu Phe Tyr 
420 425 430 



36 



WO 00/04166 



PCT/US99/15871 



Phe Ala Gin Lys He Asp Tyr Leu Lys Asp Lys He Gin Pro Ser Phe 
435 440 445 

Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val 
450 455 460 

Arg He Asn Ala Leu Val Ala Lys Ala Gin Lys He Pro Glu Glu Gly 
465 470 475 480 

Trp Thr Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn Thr Arg Asp 
485 490 495 

His Pro Gly Met He Gin Val Phe Leu Gly His Ser Gly Gly Leu Asp 
500 505 510 

Thr Asp Gly Asn Glu Leu Pro Arg Leu He Tyr Val Ser Arg Glu Lys 
515 520 525 

Arg Pro Gly Phe Gin His His Lys Lys Ala Gly Ala Met Asn Ala Leu 
530 535 540 

lie Arg Val Ser Ala Val Leu Thr Asn Gly Ala Tvr Leu Leu Asn Val 
545 550 555 " 560 

Asp Cys Asp His Tyr Phe Asn Asn Ser Lys Ala He Lys Glu Ala Met 
565 570 575 

Cys Phe Met Met Asp Pro Ala He Gly Lys Lys Cys Cys Tyr Val Gin 
580 585 ~ 590 

Phe Pro Gin Arg Phe Asp Gly He Asp Leu His Asp Arg Tyr Ala Asn 
595 600 605 

Arg Asn He Val Phe Phe Asp He Asn Met Lys Gly Leu Asp Gly He 
610 615 620 

Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Cys Phe Asn Arg Gin Ala 
625 630 635 640 

Leu Tyr Gly Tyr Asp Pro Val Leu Thr Glu Glu Asp Leu Glu Pro Asn 
645 650 655 

He He Val Lys Ser Cys Cys Gly Ser Arg Lys Lys Gly Lys Ser Ser 
660 665 670 

Lys Lys Tyr Asn Tyr Glu Lys Arg Arg Gly He Asn Arg Ser Asp Ser 
675 680 685 

Asn Ala Pro Leu Phe Asn Met Glu Asp He Asp Glu Gly Phe Glu Gly 
690 695 700 

Tyr Asp Asp Glu Arg Ser He Leu Met Ser Gin Arg Ser Val Glu Lys 
705 710 715 720 

Arg Phe Gly Gin Ser Pro Val Phe He Ala Ala Thr Phe Met Glu Gin 
725 730 735 

Gly Gly He Pro Pro Thr Thr Asn Pro Ala Thr Leu Leu Lys Glu Ala 
740 745 750 
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lie His Val He Ser Cys Gly Tyr Glu Asp Lys Thr Glu Trp Gly Lys 
755 760 765 

Glu He Gly Trp lie Tyr Gly Ser Val Thr Glu Asp He Leu Thr Gly 
770 775 780 

Phe Lys Met His Ala Arg Gly Trp He Ser He Tyr Cys Asn Pro Pro 
785 790 795 800 

Arg Pro Ala Phe Lys Gly Ser Ala Pro He Asn Leu Ser Asp Arg Leu 
805 810 815 

Asn Gin Val Leu Arg Trp Ala Leu Gly Ser He Glu lie Leu Leu Ser 
820 825 830 

Arg His Cys Pro He Trp Tyr Gly Tyr His Gly Arg Leu Arg Leu Leu 
835 840 845 

Glu Arg He Ala Tyr He Asn Thr He Val Tyr Pro He Thr Ser He 
850 855 860 

Pro Leu He Ala Tyr Cys lie Leu Pro Ala Phe Cys Leu lie Thr Asp 
865 870 875 880 

Arg Phe lie lie Pro Glu lie Ser Asn Tyr Ala Ser lie Trp Phe lie 
885 890 895 

Leu Leu Phe lie Ser lie Ala Val Thr Gly lie Leu Glu Leu Arg Trp 
900 905 910 

Ser Gly Val Ser lie Glu Asp Trp Trp Arg Asn Glu Gin Phe Trp Val 
915 920 925 

lie Gly Gly Thr Ser Ala His Leu Phe Ala Val Phe Gin Gly Leu Leu 
930 935 940 

Lys Val Leu Ala Gly lie Asp Thr Asn Phe Thr Val Thr Ser Lys Ala 
945 950 955 960 

Thr Asp Glu Asp Gly Asp Phe Ala Glu Leu Tyr He Phe Lys Trp Thr 
965 970 975 

Ala Leu Leu lie Pro Pro Thr Thr Val Leu Leu Val Asn Leu lie Gly 
980 985 990 

lie Val Ala Gly Val Ser Tyr Ala Val Asn Ser Gly Tyr Gin Ser Trp 
995 1000 1005 

Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Leu Trp Val He Ala His 
1010 1015 1020 

Leu Tyr Pro Phe Leu Lys Gly Leu Leu Gly Arg Gin Asn Arg Thr Pro 
1025 1030 1035 1040 

Thr lie Val lie Val Trp Ser Val Leu Leu Ala Ser lie Phe Ser Leu 
1045 1050 1055 

Leu Trp Val Arg lie Asn Pro Phe Val Asp Ala Asn Pro Asn Ala Asn 
1060 1065 1070 
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Asn Phe Asn Gly Lys Gly Gly Val Phe 

1080 





1075 


<210> 


24 


<211> 


1084 


<212> 


PRT 


<213> 


Arabidopsis thaliana 


<400> 


24 



Met Asn Thr Gly Gly Arg Leu lie Ala Gly Ser His Asn Arg Asn Glu 
15 io 15 

Phe Val Leu He Asn Ala Asp Glu Ser Ala Arg He Arg Ser Val Gin 
20 25 30 

Glu Leu Ser Gly Gin Thr Cys Gin He Cys Gly Asp Glu He Glu Leu 
35 40 45 

Thr Val Ser Ser Glu Leu Phe Val Ala Cys Asn Glu Cys Ala Phe Pro 
50 55 60 

Val Cys Arg Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Asn Gin Ala 
65 70 75 * 80 

Cys Pro Gin Cys Lys Thr Arg Tyr Lys Arg He Lys Gly Ser Pro Arg 
85 90 * 95 

Val Asp Gly Asp Asp Glu Glu Glu Glu Asp lie Asp Asp Leu Glu Tyr 
100 105 * HO 

Glu Phe Asp His Gly Met Asp Pro Glu His Ala Ala Glu Ala Ala Leu 
115 120 125 

Ser Ser Arg Leu Asn Thr Gly Arg Gly Gly Leu Asp Ser Ala Pro Pro 
130 135 140 

Gly Ser Gin He Pro Leu Leu Thr Tyr Cys Asp Glu Asp Ala Asp Met 
145 150 155 i 60 

Tyr Ser Asp Arg His Ala Leu He Val Pro Pro Ser Thr Gly Tyr Gly 
165 170 175 

Asn Arg Val Tyr Pro Ala Pro Phe Thr Asp Ser Ser Ala Pro Pro Gin 
180 185 190 

Ala Arg Ser Met Val Pro Gin Lys Asp He Ala Glu Tyr Gly Tyr Gly 
195 200 205 

Ser Val Ala Trp Lys Asp Arg Met Glu Val Trp Lys Arg Arg Gin Gly 
210 215 220 

Glu Lys Leu Gin Val He Lys His Glu Gly Gly Asn Asn Gly Arg Gly 
225 230 235 * 240 

Ser Asn Asp Asp Asp Glu Leu Asp Asp Pro Asp Met Pro Met Met Asp 
245 250 255 

Glu Gly Arg Gin Pro Leu Ser Arg Lys Leu Pro He Arg Ser Ser Arg 
260 265 270 
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lie Asn Pro Tyr Arg Met Leu lie Leu Cys Arg Leu Ala He Leu Glv 
275 280 285 

Leu Phe Phe His Tyr Arg He Leu His Pro Vai Asn Asp Ala Tvr Glv 
290 295 300 

Leu Trp Leu Thr Ser Val He Cys Glu He Trp Phe Ala Val Ser Trp 
305 310 315 320 

He Leu Asp Gin Phe Pro Lys Trp Tyr Pro He Glu Arg Glu Thr Tyr 
325 330 - 335 

Leu Asp Arg Leu Ser Leu Arg Tyr Glu Lys Glu Gly Lys Pro Ser Gly 
340 345 350 

Leu Ala Pro Val Asp Val Phe Val Ser- Thr Val Asp Pro Leu Lys Glu 
355 360 365 

Pro Pro Leu He Thr Ala Asn Thr Val Leu Ser He Leu Ala Val Asp 
370 375 380 

Tyr Pro Val Asp Lys Val Ala Cys Tyr Val Ser Asp Asd Gly Ala Ala 
385 390 395 * 400 

Met Leu Thr Phe Glu Ala Leu Ser Asp Thr Ala Glu Phe Ala Arg Lys 
405 410 415 

Trp Val Pro Phe Cys Lys Lys Phe Asn He Glu Pro Arg Ala Pro Glu 
420 425 430 

Trp Tyr Phe Ser Gin Lys Met Asp Tyr Leu Lys Asn Lys Val His Pro 
435 440 445 

Ala Phe Val Arg Glu Arg Arg Ala Met Lys Arg Asp Tyr Glu Glu Phe 
450 455 460 

Lys Val Lys He Asn Ala Leu Val Ala Thr Ala Gin Lys Val Pro Glu 
465 470 475 480 

Glu Gly Trp Thr Met Gin Asp Gly Thr Pro Tro Pro Gly Asn Asn Val 
485 490 495 

Arg Asp His Pro Gly Met He Gin Val Phe Leu Gly His Ser Gly Val 
500 505 ' 510 

Arg Asp Thr Asp Gly Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arq 
515 520 525 

Glu Lys Arg Pro Gly Phe Asp His His Lys Lys Ala Gly Ala Met Asn 
530 - 535 540 



Ser Leu He Arg Val Ser Ala Val Leu Ser Asn Ala Pro Tyr Leu Leu 

545 550 555 - 560 

Asn Val Asp Cys Asp His Tyr He Asn Asn Ser Lys Ala He Arg Glu 

565 570 - 575 

Ser Met Cys Phe Met Met Asp Pro Gin Ser Gly Lys Lys Val Cys Tyr 

580 585 590 
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Val Gin Phe Pro Gin Arg Phe Asp Gly He Asp Arg His Asp Arg Tyr 
595 600 605 

Ser Asn Arg Asn Val Val Phe Phe Asp He Asn Met Lys Gly Leu Asp 
610 615 620 

Gly He Gin Gly Pro He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg 
625 630 635 640 

Gin Ala Leu Tyr Gly Phe Asp Ala Pro Lys Lys Lys Lys Pro Pro Gly 
645 650 655 

Lys Thr Cys Asn Cys Trp Pro Lys Trp Cys Cys Leu Cys Cys Gly Leu 
660 665 670 

Arg Lys Lys Ser Lys Thr Lys Ala Lys Asp Lys Lys Thr Asn Thr Lys 
675 680 685 

Glu Thr Ser Lys Gin He His Ala Leu Glu Asn Val Asp Glu Gly Val 
690 695 700 

He Val Pro Val Ser Asn Val Glu Lys Arg Ser Glu Ala Thr Gin Leu 
705 710 715 720 

Lys Leu Glu Lys Lys Phe Gly Gin Ser Pro Val Phe Val Ala Ser Ala 
725 730 735 

Val Leu Gin Asn Gly Gly Val Pro Arg Asn Ala Ser Pro Ala Cys Leu 
740 745 750 

Leu Arg Glu Ala He Gin Val He Ser Cys Gly Tyr Glu Asp Lys Thr 
755 760 765 

Glu Trp Gly Lys Glu He Gly Trp He Tyr Gly Ser Val Thr Glu Asp 
770 775 780 

He Leu Thr Gly Phe Lys Met His Cys His Gly Trp Arg Ser Val Tyr 
785 790 795 * " 800 

Cys Met Pro Lys Arg Ala Ala Phe Lys Gly Ser Ala Pro He Asn Leu 
805 810 815 

Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu 
820 825 830 

He Phe Leu Ser Arg His Cys Pro He Trp Tyr Gly Tyr Gly Gly Gly 
835 840 845 

Leu Lys Trp Leu Glu Arg Phe Ser Tyr He Asn Ser Val Val Tyr Pro 
850 855 860 

Trp Thr Ser Leu Pro Leu He Val Tyr Cys Ser Leu Pro Ala Val Cys 
865 870 875 880 

Leu Leu Thr Gly Lys Phe He Val Pro Glu He Ser Asn Tyr Ala Gly 
885 890 895 



He Leu Phe Met Leu Met Phe He Ser lie Ala Val Thr Gly lie Leu 
900 905 910 
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Glu Met Gin Trp Gly Gly Val Gly lie Asp Asp Trp TrD Arg Asn Glu 
915 920 925 

Gin Phe Trp Val lie Gly Gly Ala Ser Ser His Leu Phe Ala Leu Phe 
930 935 940 

Gin Gly Leu Leu Lys Val Leu Ala Gly Val Asn Thr Asn Phe Thr Val 
945 950 955 960 

Thr Ser Lys Ala Ala Asp Asp Gly Ala Phe Ser Glu Leu Tyr He Phe 
965 970 975 

Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu He He Asn 
980 985 990 

He lie Gly Val He Val Gly Val Ser- Asp Ala He Ser Asn Gly Tyr 
995 1000 1005 

Asp Ser Trp Gly Pro Leu Phe Gly Arg Leu Phe Phe Ala Leu Trp Val 
1010 1015 1020 

He Val His Leu Tyr Pro Phe Leu Lys Gly Met Leu Gly Lys Gin Asp 
1025 1030 1035 * - 1040 

Lys Met Pro Thr He He Val Val Trp Ser He Leu Leu Ala Ser He 
1045 1050 1055 

Leu Thr Leu Leu Trp Val Arg Val Asn Pro Phe Val Ala Lys Gly Gly 
1060 1065 1070 

Pro Val Leu Glu He Cys Gly Leu Asn Cys Gly Asn 
1075 1080 

<210> 25 
<211> 685 
<212> PRT 

<213> Gossypium hirsutum 
<400> 25 

Arg Arg Trp Val Pro Phe Cys Lys Lys His Asn Val Glu Pro Arg Ala 
15 10 is 

Pro Glu Phe Tyr Phe Asn Glu Lys He Asp Tyr Leu Lys Asp Lys Val 
20 25 30 

His Pro Ser Phe Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu 
35 40 45 

Glu Phe Lys Val Arg He Asn Ala Leu Val Ala Lys Ala Gin Lys Lys 
50 55 60 

Pro Glu Glu Gly Trp Val Met Gin Asp Gly Thr Pro Trp Pro Gly Asn 
65 70 75 80 

Asn Thr Arg Asp His Pro Gly Met He Gin Val Tyr Leu Gly Ser Ala 
85 90 95 

Gly Ala Leu Asp Val Asp Gly Lys Glu Leu Pro Arg Leu Val Tyr Val 
100 105 no 
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Ser Arg Glu Lys Arg Pro Gly Tyr Gin His His Lys Lys Ala Gly Ala 
115 120 125 

Glu Asn Ala Leu Val Arg Val Ser Ala Val Leu Thr Asn Ala Pro Phe 
130 135 140 

He Leu Asn Leu Asp Cys Asp His Tyr He Asn Asn Ser Lys Ala Met 
145 150 155 160 

Arg Glu Ala Met Cys Phe Leu Met Asp Pro Gin Phe Gly Lys Lys Leu 
165 170 175 

Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly He Asp Arg His Asp 
180 185 190 

Arg Tyr Ala Asn Arg Asn Val Val Phe Phe Asp He Asn Met Leu Gly 
195 200 205 

Leu Asp Gly Leu Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe 
210 215 220 

Asn Arg Gin Ala Leu Tyr Gly Tyr Asp Pro Pro Val Ser Glu Lys Arq 
225 230 235 240 

Pro Lys Met Thr Cys Asp Cys Trp Pro Ser Trp Cys Cys Cys Cys Cys 
245 250 ' 255 

Gly Gly Ser Arg Lys Lys Ser Lys Lys Lys Gly Glu Lys Lys Gly Leu 
260 265 270 

Leu Gly Gly Leu Leu Tyr Gly Lys Lys Lys Lys Met Met Gly Lys Asn 
275 280 285 

Tyr Val Lys Lys Gly Ser Ala Pro Val Phe Asp Leu Glu Glu He Glu 
290 295 300 

Glu Gly Leu Glu Gly Tyr Glu Glu Leu Glu Lys Ser Thr Leu Met Ser 
305 310 315 320 

Gin Lys Asn Phe Glu Lys Arg Phe Gly Gin Ser Pro Val Phe He Ala 
325 330 335 

Ser Thr Leu Met Glu Asn Gly Gly Leu Pro Glu Gly Thr Asn Ser Thr 
340 345 350 

Ser Leu lie Lys Glu Ala He His Val He Ser Cys Gly Tyr Glu Glu 
355 360 365 

Lys Thr Glu Trp Gly Lys Glu He Gly Trp He Tyr Gly Ser Val Thr 
370 375 380 



Glu Asp He Leu Thr Gly Phe Lys Met His Cys Arg Gly Trp Lys Ser 
385 390 395 400 

Val Tyr Cys Val Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro He 
405 410 415 

Asn Leu Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser 
420 425 430 
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Val Glu He Phe Leu Ser Arg His Cys Pro Leu Trp Tyr Gly Tyr Gly 
435 440 445 

Gly Lys Leu Lys Trp Leu Glu Arg Leu Ala Tyr He Asn Thr He Val 
450 455 460 

Tyr Pro Phe Thr Ser He Pro Leu Leu Ala Tyr Cys Thr He Pro Ala 
465 470 475 480 

Val Cys Leu Leu Thr Gly Lys Phe He He Pro Thr Leu Ser Asn Leu 
485 490 495 

Thr Ser Val Trp Phe Leu Ala Leu Phe Leu Ser He He Ala Thr Gly 
500 505 510 

Val Leu Glu Leu Arg Trp Ser Gly Val Ser He Gin Asp Trp Trp Arq 
515 520 525 

Asn Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu Phe Ala 
530 535 540 

Val Phe Gin Gly Leu Leu Lys Val Leu Ala Gly Val Asp Thr Asn Phe 
545 550 555 560 

Thr Val Thr Ala Lys Ala Ala Asp Asp Thr Glu Phe Gly Glu Leu Tyr 
565 570 575 

Leu Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu He He 
580 585 590 

Leu Asn Met Val Gly Val Val Ala Gly Val Ser Asp Ala He Asn Asn 
595 600 605 

Gly Tyr Gly Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe 
610 615 620 

Trp Val He Leu His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg 
625 630 635 640 

Gin Asn Arg Thr Pro Thr He Val Val Leu Trp Ser He Leu Leu Ala 
645 650 655 

Ser He Phe Ser Leu Val Trp Val Arg He Asp Pro Phe Leu Pro Lys 
660 665 670 

Gin Thr Gly Pro Val Leu Lys Gin Cys Gly Val Glu Cys 
675 680 685 

<210> 26 

<211> 1111 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 26 

Met Ala Ser Thr Pro Pro Gin Thr Ser Lys Lys Val Arg Asn Asn Ser 

1 5 io 15 

Gly Ser Gly Gin Thr Val Lys Phe Ala Arg Arg Thr Ser Ser Gly Arg 
20 25 30 
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Tyr Val Ser Leu Ser Arg Asp Asn He Glu Leu Ser Gly Glu Leu Ser 
35 40 45 

Gly Asp Tyr Ser Asn Tyr Thr Val His He Pro Pro Thr Pro Asp Asn 
50 55 60 

Gin Pro Met Ala Thr Lys Ala Glu Glu Gin Tyr Val Ser Asn Ser Leu 
65 70 75 80 

Phe Thr Gly Gly Phe Asn Ser Val Thr Arg Ala His Leu Met Asp Lys 
85 90 95 

Val He Asp Ser Asp Val Thr His Pro Gin Met Ala Gly Ala Lys Gly 
100 105 no 

Ser Ser Cys Ala Met Pro Ala Cys Asp Gly Asn Val Met Lys Asp Glu 
115 120 125 

Arg Gly Lys Asp Val Met Pro Cys Glu Cys Arg Phe Lys He Cys Arg 
130 135 140 

Asp Cys Phe Met Asp Ala Gin Lys Glu Thr Gly Leu Cys Pro Gly Cys 
145 150 155 " 160 

Lys Glu Gin Tyr Lys He Gly Asp Leu Asp Asp Asp Thr Pro Asp Tyr 
165 170 175 

Ser Ser Gly Ala Leu Pro Leu Pro Ala Pro Gly Lys Asp Gin Arg Gly 
180 185 * 190 

Asn Asn Asn Asn Met Ser Met Met Lys Arg Asn Gin Asn Gly Glu Phe 
195 200 205 

Asp His Asn Arg Trp Leu Phe Glu Thr Gin Gly Thr Tyr Gly Tyr Glv 
210 215 220 

Asn Ala Tyr Trp Pro Gin Asp Glu Met Tyr Gly Asp Asp Met Asp Glu 
225 230 235 * 240 

Gly Met Arg Gly Gly Met Val Glu Thr Ala Asp Lys Pro Trp Arg Pro 
245 250 " 255 

Leu Ser Arg Arg He Pro He Pro Ala Ala He He Ser Pro Tyr Arg 
260 265 270 

Leu Leu He Val He Arg Phe Val Val Leu Cys Phe Phe Leu Thr Trp 
275 280 285 

Arg lie Arg Asn Pro Asn Glu Asp Ala He Trp Leu Trp Leu Met Ser 
290 295 300 



He He Cys Glu Leu Trp Phe Gly Phe Ser Trp He Leu Asp Gin He 

305 310 315 320 

Pro Lys Leu Cys Pro He Asn Arg Ser Thr Asp Leu Glu Val Leu Arg 

325 330 335 

Asp Lys Phe Asp Met Pro Ser Pro Ser Asn Pro Thr Gly Arg Ser Asp 

340 345 350 
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Leu Pro Gly He Asp Leu Phe Val Ser Thr Ala Asp Pro Glu Lys Glu 
355 360 365 

Pro Pro Leu Val Thr Ala Asn Thr He Leu Ser He Leu Ala Val Asp 
370 375 380 

Tyr Pro Val Glu Lys Val Ser Cys Tyr Leu Ser Asp Asp Gly Gly Ala 
385 390 395 400 

Leu Leu Ser Phe Glu Ala Met Ala Glu Ala Ala Ser Phe Ala Asp Leu 
405 410 415 

Trp Val Pro Phe Cys Arg Lys His Asn lie Glu Pro Arg Asn Pro Asp 
420 425 430 

Ser Tyr Phe Ser Leu Lys He Asp Pro Thr Lys Asn Lys Ser Arg He 
435 440 445 

Asp Phe Val Lys Asp Arg Arg Lys He Lys Arg Glu Tyr Asp Glu Phe 
450 455 460 

Lys Val Arg He Asn Gly Leu Pro Asp Ser He Arg Arg Arg Ser Asd 
465 470 475 " 480 

Ala Phe Asn Ala Arg Glu Glu Met Lys Ala Leu Lys Gin Met Arg Glu 
485 490 ' 495 

Ser Gly Gly Asp Pro Thr Glu Pro Val Lys Val Pro Lys Ala Thr Trp 
500 505 510 

Met Ala Asp Gly Thr His Trp Pro Gly Thr Trp Ala Ala Ser Thr Arg 
515 520 525 

Glu His Ser Lys Gly Asp His Ala Gly He Leu Gin Val Met Leu Lys 
530 535 540 

Pro Pro Ser Ser Asp Pro Leu He Gly Asn Ser Asp Asp Lys Val He 
545 550 555 560 

Asp Phe Ser Asp Thr Asp Thr Arg Leu Pro Met Phe Val Tyr Val Ser 
565 570 575 

Arg Glu Lys Arg Pro Gly Tyr Asp His Asn Lys Lys Ala Gly Ala Met 
580 585 590 

Asn Ala Leu Val Arg Ala Ser Ala He Leu Ser Asn Gly Pro Phe He 
595 600 605 

Leu Asn Leu Asp Cys Asp His Tyr He Tyr Asn Cys Lys Ala Val Arg 
610 615 620 



Glu Gly Met Cys Phe Met Met Asp Arg Gly Gly Glu Asp He Cys Tyr 

625 630 635 640 

He Gin Phe Pro Gin Arg Phe Glu Gly He Asp Pro Ser Asp Arg Tyr 

645 650 655 

Ala Asn Asn Asn Thr Val Phe Phe Asp Gly Asn Met Arg Ala Leu Asp 

660 665 670 
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Gly Val Gin Gly Pro Val Tyr Val Gly Thr Gly Thr Met Phe Arg Arg 
675 680 685 

Phe Ala Leu Tyr Gly Phe Asp Pro Pro Asn Pro Asp Lys Leu Leu Glu 
690 695 700 

Lys Lys Glu Ser Glu Thr Glu Ala Leu Thr Thr Ser Asp Phe Asp Pro 
705 710 715 720 

Asp Leu Asp Val Thr Gin Leu Pro Lys Arg Phe Gly Asn Ser Thr Leu 
725 730 735 

Leu Ala Glu Ser He Pro He Ala Glu Phe Gin Gly Arg Pro Leu Ala 
740 745 " 750 

Asp His Pro Ala Val Lys Tyr Gly Arg Pro Pro Gly Ala Leu Arg Val 
755 760 765 

Pro Arg Asp Pro Leu Asp Ala Thr Thr Val Ala Glu Ser Val Ser Val 
770 775 780 

He Ser Cys Trp Tyr Glu Asp Lys Thr Glu Trp Gly Asp Arg Val Gly 
785 790 795 800 

Trp He Tyr Gly Ser Val Thr Glu Asp Val Val Thr Gly Tyr Arg Met 
805 810 815 

His Asn Arg Gly Trp Arg Ser Val Tyr Cys He Thr Lys Arg Asp Ser 
820 825 830 

Phe Arg Gly Ser Ala Pro He Asn Leu Thr Asp Arg Leu His Gin Val 
835 840 845 

Leu Arg Trp Ala Thr Gly Ser Val Glu He Phe Phe Ser Arg Asn Asn 
850 855 860 

Ala He Leu Ala Ser Lys Arg Leu Lys Phe Leu Gin Arg Leu Ala Tyr 
865 870 875 " 880 

Leu Asn Val Gly He Tyr Pro Phe Thr Ser Leu Phe Leu He Leu Tyr 
885 890 895 

Cys Phe Leu Pro Ala Phe Ser Leu Phe Ser Gly Gin Phe He Val Arg 
900 905 910 

Thr Leu Ser He Ser Phe Leu Val Tyr Leu Leu Met He Thr He Cys 
915 920 925 

Leu He Gly Leu Ala Val Leu Glu Val Lys Trp Ser Gly He Gly Leu 
930 935 940 

Glu Glu Trp Trp Arg Asn Glu Gin Trp Trp Leu He Ser Gly Thr Ser 
945 950 955 960 

Ser His Leu Tyr Ala Val Val Gin Gly Val Leu Lys Val He Ala Gly 
965 970 975 



He Glu He Ser Phe Thr Leu Thr Thr Lys Ser Gly Gly Asp Asp Asn 
980 985 " 990 
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Glu Asp lie Tyr Ala Asp Leu Tyr lie Val Lys Trp Ser Ser Leu Met 
995 1000 1005 

lie Pro Pro lie Val lie Ala Met Val Asn He lie Ala He Val Val 
1010 1015 1020 

Ala Phe He Arg Thr He Tyr Gin Ala Val Pro Gin Trp Ser Lys Leu 
1025 1030 1035 1040 

He Gly Gly Ala Phe Phe Ser Phe Trp Val Leu Ala His Leu Tyr Pro 
1045 1050 1055 

Phe Ala Lys Gly Leu Met Gly Arg Arg Gly Lys Thr Pro Thr He Val 
1060 1065 1070 

Phe Val Trp Ala Gly Leu He Ala He Thr He Ser Leu Leu Trp Thr 
1075 1080 1085 

Ala lie Asn Pro Asn Thr Gly Pro Ala Ala Ala Ala Glu Gly Val Gly 
1090 1095 noo 

Gly Gly Gly Phe Gin Phe Pro 
1105 1110 

<210> 27 

<211> 1026 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 27 

Met Glu Ala Ser Ala 

1 5 

Leu Val Val He His 
20 

Asp Gly Gin Phe Cys 

35 

Glu Gly Asp Leu Phe 
50 

Arg Pro Cys Tyr Glu 
65 

Gin Cys Lys Thr Arg 
85 

Gly Asp Glu Asp Glu 
100 

He Glu His Glu Gin 
115 

Tyr Gly Lys Met Ser 
130 

Arg Phe Pro Pro Val 
145 



Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu 
10 15 

Asn His Glu Glu Pro Lys Pro Leu Lys Asn Leu 
25 30 

Glu He Cys Gly Asp Gin He Gly Leu Thr Val 
40 45 

Val Ala Cys Asn Glu Cys Gly Phe Pro Ala Cys 
55 60 

Tyr Glu Arg Arg Glu Gly Thr Gin Asn Cys Pro 
70 75 ~ 80 

Tyr Lys Arg Leu Arg Gly Ser Pro Arg Val Glu 
90 95 

Glu Asp He Asp Asp He Glu Tyr Glu Phe Asn 
105 no 

Asp Lys His Lys His Ser Ala Glu Ala Met Leu 
120 125 

Tyr Gly Arg Gly Pro Glu Asp Asp Glu Asn Gly 
135 140 



He Ala Gly Gly His Ser Gly Glu Phe Pro Val 
150 155 160 
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Gly Gly Gly Tyr Gly Asn Gly Glu His Gly Leu His Lys Arg Val His 
165 170 175 

Pro Tyr Pro Ser Ser Glu Ala Gly Ser Glu Gly Gly Trp Arg Glu Arg 
180 185 190 

Met Asp Asp Trp Lys Leu Gin His Gly Asn Leu Gly Pro Glu Pro Asp 
195 200 205 

Asp Asp Pro Glu Met Gly Leu He Asp Glu Ala Arg Gin Pro Leu Ser 
210 215 220 

Arg Lys Val Pro He Ala Ser Ser Lys He Asn Pro Tyr Arg Met Val 
225 230 235 - 240 

He Val Ala Arg Leu Val He Leu Ala Val Phe Leu Arg Tyr Arg Leu 
245 250 255 

Leu Asn Pro Val His Asp Ala Leu Gly Leu Trp Leu Thr Ser Val He 
260 265 270 

Cys Glu He Trp Phe Ala Val Ser Trp He Leu Asp Gin Phe Pro Lys 
275 280 285 

Trp Phe Pro He Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Arg 
290 295 300 

Tyr Glu Arg Glu Gly Glu Pro Asn Met Leu Ala Pro Val Asp Val Phe 
305 310 315 320 

Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu Val Thr Ser Asn 
325 330 335 

Thr Val Leu Ser He Leu Ala Met Asp Tyr Pro Val Glu Lys He Ser 
340 345 350 

Cys Tyr Val Ser Asp Asp Gly Ala Ser Met Leu Thr Phe Glu Ser Leu 
355 360 365 

Ser Glu Thr Ala Glu Phe Ala Arg Lys Trp Val Pro Phe Cys Lys Lys 
370 375 380 

Phe Ser He Glu Pro Arg Ala Pro Glu Met Tyr Phe Thr Leu Lys Val 
385 390 395 400 

Asp Tyr Leu Gin Asp Lys Val His Pro Thr Phe Val Lys Glu Arg Arg 
405 410 * 415 

Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala Gin 
420 425 430 

Val Ala Lys Ala Ser Lys Val Pro Leu Glu Gly Trp He Met Gin Asp 
435 440 445 

Gly Thr Pro Trp Pro Gly Asn Asn Thr Lys Asp His Pro Gly Met He 
450 455 460 



Gin Val Phe Leu Gly His Ser Gly Gly Phe Asp Val Glu Gly His Glu 
465 470 475 480 
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Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Phe Gin 
485 490 495 

His His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Val Ala Gly 
500 505 510 

Val Leu Thr Asn Ala Pro Phe Met Leu Asn Leu Asp Cys Asp His Tyr 
515 520 525 

Val Asn Asn Ser Lys Ala Val Arg Glu Ala Met Cys Phe Leu Met Asp 
530 535 540 

Pro Gin He Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe 
545 550 555 560 

Asp Gly He Asp Thr Asn Asp Arg Tyr Ala Asn Arg Asn Thr Val Phe 
565 570 575 

Phe Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro Val Tyr 
580 585 590 

Val Gly Thr Gly Cys Val Phe Lys Arg Gin Ala Leu Tyr Gly Tyr Glu 
595 600 605 

Pro Pro Lys Gly Pro Lys Arg Pro Lys Met He Ser Cys Gly Cys Cys 
610 615 620 

Pro Cys Phe Gly Arg Arg Arg Lys Asn Lys Lys Phe Ser Lys Asn Asp 
625 630 635 640 

Met Asn Gly Asp Val Ala Ala Leu Gly Gly Ala Glu Gly Asp Lys Glu 
645 650 655 

His Leu Met Phe Glu Met Asn Phe Glu Lys Thr Phe Gly Gin Ser Ser 
660 665 670 

He Phe Val Thr Ser Thr Leu Met Glu Glu Gly Gly Val Pro Pro Ser 
675 680 ~ 685 

Ser Ser Pro Ala Val Leu Leu Lys Glu Ala He His Val He Ser Cys 
690 695 700 

Gly Tyr Glu Asp Lys Thr Glu Trp Gly Thr Glu Leu Gly Trp He Tyr 
705 710 715 720 

Gly Ser He Thr Glu Asp He Leu Thr Gly Phe Lys Met His Cys Arg 
725 730 735 

Gly Trp Arg Ser He Tyr Cys Met Pro Lys Arg Pro Ala Phe Lys Gly 
740 745 750 



Ser Ala Pro He Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Trp 
755 760 765 

Ala Leu Gly Ser Val Glu He Phe Phe Ser Arg His Ser Pro Leu Trp 
770 775 780 

Tyr Gly Tyr Lys Gly Gly Lys Leu Lys Trp Leu Glu Arg Phe Ala Tyr 
785 790 795 800 



50 



WO 00/04166 



PCT/US99/15871 



Ala Asn Thr Thr lie Tyr Pro Phe Thr Ser lie Pro Leu Leu Ala Tyr 
805 810 815 

Cys He Leu Pro Ala He Cys Leu Leu Thr Asp Lys Phe He Met Pro 
820 825 830 

Pro He Ser Thr Phe Ala Ser Leu Phe Phe He Ser Leu Phe Met Ser 
835 840 845 

He He Val Thr Gly He Leu Glu Leu Arg Trp Ser Gly Val Ser He 
850 855 860 

Glu Glu Trp Trp Arg Asn Glu Gin Phe Trp Val He Gly Gly He Ser 
865 870 875 880 

Ala His Leu Phe Ala Val Val Gin Gly Leu Leu Lys He Leu Ala Gly 
885 890 895 

He Asp Thr Asn Phe Thr Val Thr Ser Lys Ala Thr Asp Asp Asp Asp 
900 905 910 

Phe Gly Glu Leu Tyr Ala Phe Lys Trp Thr Thr Leu Leu He Pro Pro 
915 920 925 

Thr Thr Val Leu He He Asn He Val Gly Val Val Ala Gly He Ser 
930 935 940 

Asp Ala He Asn Asn Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys 
945 950 955 960 

Leu Phe Phe Ser Phe Trp Val He Val His Leu Tyr Pro Phe Leu Lys 
965 970 975 

Gly Leu Met Gly Arg Gin Asn Arg Thr Pro Thr He Val Val He Trp 
980 985 990 

Ser Val Leu Leu Ala Ser He Phe Ser Leu Leu Trp Val Arg He Asp 
995 1000 1005 

Pro Phe Val Leu Lys Thr Lys Gly Pro Asp Thr Ser Lys Cys Gly He 
1010 1015 1020 

Asn Cys 
1025 

<210> 28 
<211> 701 
<212> PRT 

<213> Gossypium hirsutum 
<400> 28 

Asp Tyr Pro Val Glu Lys Val Ser Cys Tyr Val Ser Asp Asp Gly Ala 
1 5 10 15 

Ala Met Leu Thr Phe Glu Ala Leu Ser Glu Thr Ser Glu Phe Ala Arg 
20 25 30 

Lys Trp Val Pro Phe Cys Lys Lys Tyr Asn He Glu Pro Arg Ala Pro 
35 40 45 
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Glu Trp Tyr Phe Ala Gin Lys lie Asp Tyr Leu Lys Asp Lys Val Gin 
50 55 60 

Thr Ser Phe Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu 
55 70 75 " 80 

Phe Lys Val Arg Val Asn Gly Leu Val Ala Lys Ala Gin Lys Val Pro 
85 90 ' 95 

Glu Glu Gly Trp lie Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn 
100 105 no 

Thr Arg Asp His Pro Gly Met He Gin Val Phe Leu Gly Gin Ser Gly 
115 120 125 

Gly Leu Asp Ala Glu Gly Asn Glu Leu Pro Arg Leu Val Tyr Val Ser 
130 135 140 

Arg Glu Lys Arg Pro Gly Phe Gin His His Lys Lys Ala Gly Ala Met 
145 150 155 J 160 

Asn Ala Leu Val Arg Val Ser Ala Val Leu Thr Asn Gly Ala Phe Leu 
165 170 175 

Leu Asn Leu Asp Cys Asp His Tyr He Asn Asn Ser Lys Ala Leu Arg 
180 185 190 

Glu Ala Met Cys Phe Leu Met Asp Pro Asn Leu Gly Lys Gin Val Cys 
195 200 205 

Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly He Asp Arg Asn Asp Arg 
210 215 ~ 220 

Tyr Ala Asn Arg Asn Thr Val Phe Phe Asp He Asn Leu Arg Gly Leu 
225 230 235 240 

Asp Gly He Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe Asn 
245 250 255 

Arg Thr Ala Leu Tyr Gly Tyr Glu Pro Pro Leu Lys Pro Lys His Arg 
260 265 * 270 

Lys Thr Gly He Leu Ser Ser Leu Cys Gly Gly Ser Arg Lys Lys Ser 
275 280 285 

Ser Lys Ser Ser Lys Lys Gly Ser Asp Lys Lys Lys Ser Gly Lys His 
290 295 300 

Val Asp Ser Thr Val Pro Val Phe Asn Leu Glu Asp He Glu Glu Gly 
305 310 315 320 

Val Glu Gly Ala Gly Phe Asp Asp Glu Lys Ser Leu Leu Met Ser Gin 
325 330 335 



Met Ser Leu Glu Lys Arg Phe Gly Gin Ser Ala Val Phe Val Ala Ser 
340 345 350 

Thr Leu Met Glu Asn Gly Gly Val Pro Gin Ser Ala Thr Pro Glu Thr 
355 360 365 
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Leu Leu Lys Glu Ala lie His Val He Ser Cys Gly Tyr Glu Asp Lys 
370 375 380 

Thr Asp Trp Gly Ser Glu He Gly Trp He Tyr Gly Ser Val Thr Glu 
385 390 395 400 

Asp He Leu Thr Gly Phe Lys Met His Ala Arg Gly Trp Arg Ser He 
405 410 " 415 

Tyr Cys Met Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro He Asn 
420 425 430 

Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Trp Ala Leu Gly Ser Val 
435 440 445 

Glu He Leu Phe Ser Arg His Cys Pro He Trp Tyr Gly Tyr Ser Gly 
450 455 460 

Arg Leu Lys Trp Leu Glu Arg Phe Ala Tyr Val Asn Thr Thr He Tyr 
465 470 475 480 

Pro Val Thr Ala He Pro Leu Leu Met Tyr Cys Thr Leu Pro Ala Val 
485 490 495 

Cys Leu Leu Thr Asn Lys Phe He He Pro Gin He Ser Asn Leu Ala 
500 505 510 

Ser He Trp Phe He Ser Leu Phe Leu Ser lie Phe Ala Thr Gly He 
515 520 525 

Leu Lys Met Lys Trp Asn Gly Val Gly He Asp Gin Trp Trp Arg Asn 
530 535 540 

Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu Phe Ala Val 
545 550 555 560 

Phe Gin Gly Leu Leu Lys Val Leu Ala Gly He Asp Thr Asn Phe Thr 
565 570 575 

Val Thr Ser Lys Ala Ser Asp Glu Asp Gly Asp Phe Ala Glu Leu Tyr 
580 585 590 

Met Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu He 
595 600 605 

He Asn Leu Val Gly Val Val Ala Gly He Ser Tyr Val He Asn Ser 
610 615 620 

Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe 
625 630 635 640 

Trp Val He He His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg 
645 650 655 

Gin Asn Arg Thr Pro Thr He Val Val Val Trp Ser lie Leu Leu Ala 
660 665 * 670 



Ser lie Phe Ser Leu Leu Trp Val Arg lie Asp Pro Phe Thr Thr Arg 
675 680 685 
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Val Thr Gly Pro Asp Val Glu Gin Cys Gly lie Asn Cys 
690 695 700 

<210> 29 
<211> 1081 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 29 

Arg Pro Arg Leu He Ala Gly Ser His Asn Arg Asn Glu Phe Val Leu 
15 10 15 

He Asn Ala Asp Glu Asn Ala Arg He Arg Ser Val Gin Glu Leu Ser 
20 25 30 

Gly Gin Thr Cys Gin He Cys Arg Asp Glu He Glu Leu Thr Val Asp 
35 40 45 

Gly Glu Pro Phe Val Ala Cys Asn Glu Cys Ala Phe Pro Val Cys Arg 
50 55 60 

Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Asn Gin Ala Cys Pro Gin 
65 70 75 ' 80 

Cys Lys Thr Arg Phe Lys Arg Leu Lys Gly Ser Pro Arg Val Glu Gly 
85 90 95 

Asp Glu Glu Glu Asp Asp He Asp Asp Leu Asp Asn Glu Phe Glu Tyr 
100 105 HO 

Gly Asn Asn Gly He Gly Phe Asp Gin Val Ser Glu Gly Met Ser He 
115 120 125 

Ser Arg Arg Asn Ser Gly Phe Pro Gin Ser Asp Leu Asp Ser Ala Pro 
130 135 140 

Pro Gly Ser Gin He Pro Leu Leu Thr Tyr Gly Asp Glu Asp Val Glu 
145 150 155 A " 160 

He Ser Ser Asp Arg His Ala Leu He Val Pro Pro Ser Leu Gly Gly 
165 170 175 

His Gly Asn Arg Val His Pro Val Ser Leu Ser Asp Pro Thr Val Ala 
180 185 190 

Ala His Arg Arg Leu Met Val Pro Gin Lys Asp Leu Ala Val Tyr Gly 
195 200 205 

Tyr Gly Ser Val Ala Trp Lys Asp Arg Met Glu Glu Trp Lys Arg Lys 
210 215 220 

Gin Asn Glu Lys Leu Gin Val Val Arg His Glu Gly Asp Pro Asp Phe 
225 230 235 240 

Glu Asp Gly Asp Asp Ala Asp Phe Pro Met Met Asp Glu Gly Arg Gin 
245 250 255 

Pro Leu Ser Met Lys He Pro He Lys Ser Ser Lys He Asn Pro Tyr 
260 265 270 
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Arg Met Leu lie Val Leu Arg Leu Val lie Leu Gly Leu Phe Phe His 
275 280 285 

Tyr Arg He Leu His Pro Val Lys Asp Ala Tyr Ala Leu Trp Leu He 
290 295 300 

Ser Val He Cys Glu He Trp Phe Ala Val Ser Trp Val Leu Asp Gin 
305 310 315 320 

Phe Pro Lys Trp Tyr Pro He Glu Arg Glu Thr Tyr Leu Asp Arg Leu 
325 330 335 

Ser Leu Arg Tyr Glu Lys Glu Gly Lys Pro Ser Gly Leu Ser Pro Val 
340 345 350 

Asp Val Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu He 
355 360 365 

Thr Ala Asn Thr Val Leu Ser He Leu Ala Val Asp Tyr Pro Val Asp 
370 375 380 

Lys Val Ala Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe 
385 390 395 400 

Glu Ala Leu Ser Glu Thr Ala Glu Phe Ala Arg Lys Trp Val Pro Phe 
405 410 ~ ^ ' 415 

Cys Lys Lys Tyr Cys He Glu Pro Arg Ala Pro Glu Trp Tyr Phe Cys 
420 425 430 

His Lys Met Asp Tyr Leu Lys Asn Lys Val His Pro Ala Phe Val Arg 
435 440 445 

Glu Arg Arg Ala Met Lys Arg Asp Tyr Glu Glu Phe Lys Val Lys He 
450 455 460 

Asn Ala Leu Val Ala Thr Ala Gin Lys Val Pro Glu Asp Gly Trp Thr 
465 470 475 " 480 

Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Ser Val Arg Asp His Pro 
485 490 ^ 495 

Gly Met He Gin Val Phe Leu Gly Ser Asp Gly Val Arg Asp Val Glu 
500 505 510 

Asn Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro 
515 520 525 

Gly Phe Asp His His Lys Lys Ala Gly Ala Met Asn Ser Leu He Arg 
530 535 540 

Val Ser Gly Val Leu Ser Asn Ala Pro Tyr Leu Leu Asn Val Asp Cys 
545 550 555 560 

Asp His Tyr He Asn Asn Ser Lys Ala Leu Arg Glu Ala Met Cys Phe 
565 570 575 



Met Met Asp Pro Gin Ser Gly Lys Lys He Cys Tyr Val Gin Phe Pro 
580 585 " 590 
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Gin Arg Phe Asp Gly He Asp Arg His Asp Arg Tyr Ser Asn Arg Asn 
595 600 605 

Val Val Phe Phe Asp He Asn Met Lys Gly Leu Asp Gly Leu Gin Gly 
610 615 620 

Pro He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr 
625 630 635 640 

Gly Phe Asp Ala Pro Lys Lys Lys Lys Gly Pro Arg Lys Thr Cys Asn 
645 650 " " 655 

Cys Trp Pro Lys Trp Cys Leu Leu Cys Phe Gly Ser Arg Lys Asn Arg 
660 665 670 

Lys Ala Lys Thr Val Ala Ala Asp Lys Lys Lys Lys Asn Arg Glu Ala 
675 680 685 

Ser Lys Gin He His Ala Leu Glu Asn He Glu Glu Gly Arg Gly His 
690 695 700 

Lys Val Leu Asn Val Glu Gin Ser Thr Glu Ala Met Gin Met Lys Leu 
705 710 715 720 

Gin Lys Lys Tyr Gly Gin Ser Pro Val Phe Val Ala Ser Ala Arg Leu 
725 730 735 

Glu Asn Gly Gly Met Ala Arg Asn Ala Ser Pro Ala Cys Leu Leu Lys 
740 745 750 

Glu Ala He Gin Val He Ser Arg Gly Tyr Glu Asp Lys Thr Glu Trp 
755 760 765 

Gly Lys Glu lie Gly Trp He Tyr Gly Ser Val Thr Glu Asp He Leu 
770 775 780 

Thr Gly Ser Lys Met His Ser His Gly Trp Arg His Val Tyr Cys Thr 
785 790 795 800 

Pro Lys Leu Ala Ala Phe Lys Gly Ser Ala Pro He Asn Leu Ser Asp 
805 810 815 

Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu He Phe 
820 825 830 

Leu Ser Arg His Cys Pro He Trp Tyr Gly Tyr Gly Gly Gly Leu Lys 
835 840 845 

Trp Leu Glu Arg Leu Ser Tyr He Asn Ser Val Val Tyr Pro Trp Thr 
850 855 860 

Ser Leu Pro Leu He Val Tyr Cys Ser Leu Pro Ala He Cys Leu Leu 
865 870 875 880 

Thr Gly Lys Phe He Val Pro Glu He Ser Asn Tyr Ala Ser He Leu 
885 890 895 



Phe Met Ala Leu Phe Ser Ser He Ala He Thr Gly He Leu Glu Met 
900 905 ' 910 
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Gin Trp Gly Lys Val Gly lie Asp Asp Trp Trp Arg Asn Glu Gin Phe 
915 920 925 

Trp Val lie Gly Gly Val Ser Ala His Leu Phe Ala Leu Phe Gin Gly 
930 935 940 

Leu Leu Lys Val Leu Ala Gly Val Asp Thr Asn Phe Thr Val Thr Ser 
945 950 955 960 

Lys Ala Ala Asp Asp Gly Glu Phe Ser Asp Leu Tyr Leu Phe Lys Trp 
965 970 ^ 975 

Thr Ser Leu Leu He Pro Pro Met Thr Leu Leu He He Asn Val He 
980 985 990 

Gly Val He Val Gly Val Ser Asp Ala He Ser Asn Gly Tyr Asp Ser 
995 1000 1005 

Trp Gly Pro Leu Phe Gly Arg Leu Phe Phe Ala Leu Trp Val He He 
1010 1015 1020 

His Leu Tyr Pro Phe Leu Lys Gly Leu Leu Gly Lys Gin Asp Arg Met 
1025 1030 1035 • 1040 

Pro Thr He He Val Val Trp Ser He Leu Leu Ala Ser He Leu Thr 
1045 1050 1055 

Leu Leu Trp Val Arg Val Asn Pro Phe Val Ala Lys Gly Gly Pro He 
1060 1065 1070 

Leu Glu He Cys Gly Leu Asp Cys Leu 
1075 1080 
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2. n Claims Nos.: 

beoause they relate to parts of me International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

3. n Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sent enc es of Rule 6.4(a). 

Box li Observations wham unity of Invention ia lacking (Continuation of Item 2 of first shoot) 

This International Searching Authority found multiple inventions in this International application, as follows: 
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I — I searchable claims. 
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restricted to the invention first mentioned in the claims; ft is covered by claims Nos.: 
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This International Searching Authority found multiple (groups*of) 
inventions in this international application, as follows: 

1. Claims: 1-6,13-18 all partially 

Nucleic acid fragments encoding barley cellulose 
synthase, and corresponding polypeptide as represented by SEQ 
ID N0s:l and 2, fragments encoding amino acid sequence 90% 
identical thereto, transformed hosts containing andmethods 
for obtaining said sequences, methods for altering expression 
and methods for evaluating inhibitors using said sequences. 



2. Claims: 1-6,13-18 all partially and 7-12 all completely 

Nucleic acid fragments encoding com cellulose synthase, and 
corresponding polypeptide as represented by SEQ ID N0s:3-10, 
fragments encoding amino acid sequence 90% identical 
thereto, transformed hosts containing and methods for 
obtaining said sequences, methods for altering expression and 
methods for evaluating inhibitors using said sequences. 



3. Claims: 1-6,13-18 all partially 

Nucleic acid fragments encoding rice cellulose synthase, and 
corresponding polypeptide as represented by SEQ ID NOsrll 
and 12, fragments encoding amino acid sequence 98% identical 
thereto, transformed hosts containing and methods for 
obtaining said sequences, methods for altering expression and 
methods for evaluating inhibitors using said sequences. 



4. Claims: 1-6,13-18 all partially 

Nucleic acid fragments encoding soybean cellulose 
synthase, and corresponding polypeptide as represented by SEQ 
ID NOs: 13-18, fragments encoding amino acid sequence 90% 
identical thereto, transformed hosts containing and methods 
for obtaining said sequences, methods for altering expression 
and methods for evaluating inhibitors using said sequences. 



5. Claims: 1-6,13-18 all partially 

Nucleic acid fragments encoding wheat cellulose synthase, and 
corresponding polypeptide as represented by SEQ ID 
NOs: 19-22, fragments encoding amino acid sequence 90% 
identical thereto, transformed hosts containing and methods 
for obtaining said sequences, methods for altering expression 
and methods for evaluating inhibitors using said sequences. 
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Method for evaluating a conpound for inhibitory acttvty on 
cellulose synthase comparing activity of cellulosfe synthase 
produced in a transformed host with and without the addition 
of the conpound, not covered by any of the previous groups 
of claimed inventions 1-5. 
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